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Preface 


The rapid development of computing places special demands on the training of 
young specialists in this field. The complexity of the problems that must be 
solved to satisfy the needs of science, technology, industry and the economy also 
grows, simultaneously with the rapid growth in the power of modern computer 
systems. In this connection, we believe that it is essential that computing specialists 
have fundamental knowledge about the development of the related mathematical 
frameworks and about how to create methods for solving the above problems. 

Algebra and geometry are deemed important areas whose ideas and results are 
actively used in the development of information systems, as well as in software 
developed for business projects. The basic notions of algebra are numerical matrices 
and the methods for working with matrix algorithms. They may be used extensively 
in scientific and technical problems and in the game industry. The rapid development 
of game technologies, as well as augmented and alternative reality technologies, 
means that we must pay special attention to university courses in analytical 
geometry and linear algebra, pattern properties in 3D space and fast algorithms for 
working with two- and three-dimensional objects. 

Another promising area of application for linear algebra algorithms that has seen 
rapid development in recent years is Big Data. Analysis of extremely large arrays 
requires not only knowledge and use of the known methods, but it also issues the 
challenge to develop new approaches and high-performance algorithms. 

This textbook is an introduction to linear algebra and analytical geometry for 
higher-education students in the natural sciences. It is based on the courses Algebra 
and Geometry, Analytical Geometry and Fundamental and Computer Algebra, 
which are taught to first-year students of the Faculty of Computer Sciences at 
the Voronezh State University. The teaching is meant for theoretical training, as 
a supplement to the existing textbooks, for practical and laboratory classes, and also 
for self-study. Going forward, the terms “Algebra” and “Linear Algebra” will be 
considered equivalent, as well as “Geometry” and “Analytical Geometry”’. 
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The authors have attempted to lay the material down in the most comprehensible 
form while not sacrificing strictness in definitions and theorems. The statements 
(theorems, properties) are accompanied with proofs, or references to specialist 
literature for advanced study of the materials. 

The fundamentals of algebra and geometry are presented in the form most 
suitable for future specialists in computing. We have considered the basic algorithms 
for working with matrices, vectors and systems of linear equations. The theoretical 
material contains solutions of most types of problems and is supplemented with 
plenty of analysed examples. The end of an example is designated by the symbol 
. Each chapter ends with problems for self-study. Many of them are provided not 
only with full answers but also with detailed solutions. The asterisk sign (*) marks 
the advanced (enhanced complexity) problems. 

Apart from the sections traditionally included in algebra and geometry courses, 
one of the chapters is devoted to the mathematical fundamentals of the modern 
section of cryptography, namely elliptic curve cryptography. The availability of this 
chapter will be a connecting link between the mathematical courses and methods 
applied in practice by the application software developers. 

The section about quantum computing is devoted to one of the examples of the 
application of algebra. It demonstrates that the notions of linear algebra are used for 
constructing new algorithms, whose computation capacity exceeds the existing ones 
considerably. 

Let us briefly summarize the content of this textbook. The first four chapters 
are devoted to classical divisions of linear algebra; they consider matrices and 
determinants, and systems of linear equations; definitions are given for the notion of 
vector space and the fundamental solution of a homogeneous system. The next few 
chapters introduce the fundamentals of vector algebra and the coordinate method 
on a plane and in a 3D space. The following subjects are considered: vectors in 
three-dimensional space, the equation of a line on a plane, the equation of a plane 
in space and the equation of a line in space. Second-order curves are analysed. 
Material on elliptic curves is usually not included in a “traditional” algebra and 
geometry course. However, its presence in this book, in our opinion, contributes to a 
deeper understanding of the methods of linear algebra and analytical geometry and 
provides an example of the implementation of such methods for solving problems 
in theoretical and practical cryptography. 

We use the Python programming language for illustration of the considered 
algorithms. This allows us to familiarize readers with implementation at the initial 
stage of study. Python was selected because it is a universal and widely used general- 
purpose programming language, suitable for the successful realization of numerical 
algorithms; Python is a continuously evolving language; and many of its realizations 
are open source. Python has the necessary tools to automatically check for the 
errors that might appear in the program code in the process of its creation. The 
availability of a great number of additional libraries (such as NumPy, SciPy, pandas) 
substantially expands the programmer’s capabilities. Thus, this language is quite 
suitable for teaching linear algebra and analytical geometry algorithms. 
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As the time of writing, the version of Python known as “Python 2” is still being 
used in many significant projects and in the literature. However, official support for 
Python 2 is diminishing and is scheduled to end. So, we use the latest major version, 
“Python 3”, in this book. Note that there are significant differences between Python 
2 and Python 3; however, extended support documentation and tools are available 
for conversion between the two major versions. Refer to the official Python webpage 
(https://www.python.org/) for more details. 

The book offers a list of training literature on linear algebra and analytical 
geometry, which may be used for a more detailed study on the issues touched upon 
in this textbook. 

The appendices contain reference information, including basic operators in 
Python and C, trigonometric formulae and the Greek alphabet. These reduce the 
necessity to address reference literature. 

Below you can see the chart of the chapter information dependence in the form of 
an oriented graph reflecting the preferable order of covering the academic material. 
For instance, after having studied Chaps. 1, 2 and 3, you can move to one of the two 
chapters, Chap. 4 or Chap. 6, the contents of which are relatively independent. After 
Chap. 9, we think Chaps. 11 and 12 can be mastered in any order. 
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n 
dai 
i=1 


The set of natural numbers 

The set of integers 

The set of rational numbers 

The set of real numbers 

The set of complex numbers 

n-dimensional vector space 

The empty set 

Logical consequence, or implication 

Logical equivalence 

For all x, the statement P(x) is true 

There exists such x that the statement P(x) is true 
Conjunction of logical expressions A and B 
Disjunction of logical expressions A and B 
Equivalency 

The set consisting of the elements aj, a2, ..., An 


The sum aj + a2 +---+ ay 


The product aja2...an 
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Zero matrix 

Kronecker delta 

Commutator of the matrices A and B 

Trace of the matrix A 
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D is a directed graph with vertex set V and edge set E 
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Out-degree of vertex v in a digraph 

In-degree of vertex v in a digraph 

Floor function of x, i. e., the greatest integer less than or equal 
to the real number x (see definition on page 78) 
Additional minor of the matrix element placed at the intersec- 
tion of the i-th row and the j-th column 

Cofactor of the element aj; 

Inverse of the matrix A 

Minor of the k-th order (see page 59) 

Rank of the matrix A 

Exponential of the matrix A 

Logarithm of the matrix A 

Imaginary unit 

Complex number conjugate of the complex number z 
Modulus of the complex number z 

Argument of the complex number 

Hermitian conjugate matrix 

Quantum state 

Basic quantum states of the qubit 

Pauli matrices 

Vector of the n-dimensional space R” 

Zero vector 

Euclidean norm of the vector x 

General solution of a homogeneous system of linear equations 
Specific solution of a non-homogeneous system of linear 
equations 

Projection of the vector a onto the line L (see page 256) 
Normalized vectors of the Cartesian coordinate system 
Orthogonality of the vectors a and b 

Scalar or inner product of vectors 

Vectorial or outer product of vectors 

Scalar triple product 

Vector triple product 

Absolute value of the real number x 

Sign of the real number x 

Normalizing factor (see pages 285 and 311) 

Deviation of a point from a line or a plane 

Bilinear form 

Quadratic form 

Eccentricity of a curve of the second order 

Elliptic curve with real points 

Elliptic curve with rational points 

Point at infinity of an elliptic curve 

The sum of two points A and B on an elliptic curve 


Chapter 1 ® 
Matrices and Matrix Algorithms peels 


1.1 Matrices and Operations with Them 


Matrix of size m x n is arectangular table of numbers with m rows and n columns. 
A matrix is written in the form 


Q{1 aj2 ... Ain 
a2| G22... Am 

As (1.1) 
Gm1 Am2 GAmn 


Matrices are usually denoted by capital Latin letters, for example, A, B, U,... 
Numbers aj;;, included into the matrix, are its elements. An ordered set of 
elements dj1, 4j2,..., Gin of the matrix A, having similar first index i, is referred 
to as the i-th row of the matrix, while an ordered set of elements a ;,a2;,..., Qmj, 
having similar second index j, is referred to as the j-th column. Thus, the first 
index of an arbitrary element a;; indicates the row number, while the second index 
indicates the column number, at the intersection of which this element is situated. 
A brief matrix record is widely used: 


A= (aij), i=1,2,...,m; j=1,2,...,n. (1.2) 


The column of m numbers is also called n-vector, or simply vector. So, the Ist 
vector represents a single number, or, in other words, scalar. 
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2 1 Matrices and Matrix Algorithms 


Note Matrices were initially introduced for a compact record of linear equations. 
Now, they are used in various divisions of mathematics and physics and their 
applications for simpler presentation of various mathematical operations on matrix 
elements. 


Example 1.1 A point on a computer screen in RGB format is presented in the form 
of a 3-vector with components 


B=) pe; (1.3) 


where pr, PG, Pp are real numbers from interval [0, 1], they characterize the inten- 
sity of red, green and blue colour components, respectively. Various combinations 
of the component values allow obtaining any colour. In particular, vectors 


1 0.2 
Pi} =}0 | and Phy=] 0.2 (1.4) 
0 0.6 


determine red and dark-blue colours, respectively. 


If the condition m = n is met, then the matrix is called square matrix of order n. 
If the number of rows is not equal to the number of columns, and thus the inequality 
m # nis met, then such a matrix is a rectangular one. 


Note For presentation of matrices, the following notations are also used: 


Q{1 a2... Ain Q\| @12.... Ain 
a2| 422... Am a2| a22 ... Aw 

or : (1.5) 
Gm1 4m2 Amn Gm\ 4m2 Amn 


The elements of real matrices are real numbers from the set R = (—oo, ov), 
while the elements of complex matrices are complex numbers. 


Note In a standard mathematical notation, the indices of the elements begin with 
one: i, 7 = 1,2,... In many programming languages, including Python and C, 
rows and columns are numbered from zero to m — | and n — 1, respectively. 
This difference should be paid attention to when realizing matrix algorithms in the 
mentioned languages. 
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For the matrix A we will build a new matrix B, where we transpose the rows and 
the columns: 


a1 a2) Sra aml 
aj2 a22.... Am2 

B= a Me (1.6) 
Ain 42n Amn 


Such a matrix B is called transposed with respect to A and is denoted as A’. As is 
easy to see, reapplication of the transposition operation returns to the initial matrix: 
(AT)T =A. 
; ; 5 0 -4]. ‘ 
Example 1.2 Transposed with respect to the matrix A = is the matrix 
2-1 3 
5 2 
AT=] 0 -1 
—4 3 


Let A be a square matrix. Its main diagonal is a set of elements 
411, 422,.--,4nn, having the same indices, and secondary diagonal, or cross- 
diagonal, is the set of elements ay1, a(n—1)2, ---, Gin Of the matrix. 

A square matrix is called diagonal, if all of her elements located outside the main 
diagonal are equal to zero: 


dq, 0... 0 
0d... 0 
(1.7) 
0 0...d, 
If in a diagonal matrix of form (1.7) for all values i = 1, 2,..., the equalities 
d; = | are true, then the matrix is called identity matrix, or unit matrix, and is 


denoted through J, while of all the elements dj; = 0, then it is called zero matrix, 
or null matrix, and is denoted by O: 


10... 0 00... 0 


01... 0 00... 0 
T= , O= ’ (1.8) 
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For notation of the elements of an identity matrix, Kronecker! symbol is used, 
defined as follows: 


1, ifi=j, 
in ne (1.9) 
0, ifi Aj. 
Thus, in symbolic notations, we have I = (4;;), where i, j = 1,2...,n. 


Note Often, in the notation of Kronecker symbol, the indices are divided by 
commas: 6j, ;. 


The matrix A = (a;;) is called upper triangular, if aj; = 0 ati > j, ie. all 
the elements, positioned below the main diagonal, are equal to zero. Similarly the 
matrix B = (b;;) is called lower triangular, if bj; = 0 ati < j, i.e. all the elements 
above the main diagonal are equal to 0. 

Upper and lower triangular matrices may schematically be denoted as shown in 
Fig. 1.1. 

Square matrix A = (a;;) is called symmetric, if for all values 7, j = 1,2,...,n 
elements a;; = aj;, in other words, all the elements symmetric with respect to the 
main diagonal are equal to each other. 

Taking into account the notion of transposed matrix, the symmetry condition may 
be written in the form of the equality A = A’. 

For the antisymmetric matrix, the elements aj; = —ajj, where i,j = 
1,2,...,n. 

Let us turn to the notion of equality of matrices. Two matrices A = (aj;;) and 
B = (b;;) of size m x n are equal to each other if and only if a;; = b;; for alli and 
j. Thus, the property of equality can only be met for the matrices of the same size. 


Fig. 1.1 Schematic notation for upper A and lower B triangular matrices. Highlighted is the 
position of the elements other than zero 


: Leopold Kronecker (1823-1891), German mathematician. 
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Example 1.3 Consider two matrices C and D: 


1 cc dead 
C= , D= , (1.10) 
—c* ct d da 


where c and d are some real numbers. 
Equality of matrices C = D is equivalent to the system of equations reflecting 
equality of separate elements: 


a= 1, 
c= —d, 
(1.11) 
—c? d, 
c= dd’. 
Then, the matrices C and D are equal if and only if the equalities c = +1 and 


d = —1 are met. 


Example 1.4 Binary matrix or (0, 1)-matrix is called a matrix, whose elements 
take values 0 or 1. Let us calculate how many binary matrices of size m x n exist. 
Each element of such a matrix may only take two values. Since the matrix 
consisting of m rows and n columns has a total of mn elements, then we obtain 
2'"" ways to assign values to the elements. Hence, the number of binary matrices of 
sizem x nis 2”, 


Consider the basic operations on matrices. Operations on matrices are introduced 
using the well-known arithmetic operations on their elements. Addition and multi- 
plication of real numbers are naturally transferred to the matrices and form the basic 
operations of matrix algebra. 

Sum of two matrices A = (a;;) and B = (b;;) of the same size m x n is the 
matrix C = (c;;) of the same size, consisting of the elements cj; = aj; + jj. And, 
for the sum of matrices, it is written C = A+ B. 

Note, that one may only add square or rectangular matrices of the same size. 


Example 1.5 Given two matrices A and B: 


201 
eel ee pelos. (1.12) 


ieee vie 


6 1 Matrices and Matrix Algorithms 


Find their sum A + B, having performed the operations of addition of the 
respective elements: 


24004+5-143| |252 
ye ec a ||P (1.13) 
igi alg 348 


Product @A of the real number a and the matrix A = (q;;) is the matrix C = 
(cij), consisting of the elements cj; = a - aj;. 


Example 1.6 Assume that the real numbers a = 2, 6 = —3 are set, and the 
; 0 -12 0 —24 
matrix A = . ThenaA = 2A = , BA = (-3)A = 
234 —4 6 8 
03 -6 
6o919 | 


Based on the introduced operations, we may make up a difference of matrices 
according to the definition: A — B = A + (—1)B. Thus, the matrices difference is 
nothing but the sum of the first summand and the second summand, multiplied by 
the number (—1). 

Note that for antisymmetric matrix A the equality A? = —A is true. 


Example 1.7 Find the difference of the matrices defined in Example 1.5: 


Fh—1 053 20-1 053 
AaB - = +(-1) 
13 4 214 13 4 214 
_ | 2+(-)004 (-D)5-14+(-1)3] _ | 2 -5-4 
4 (1234 (11 44-14 <i @ 


The introduced operations have the following properties that are true for arbitrary 
matrices A, B and C and all A, w € R: 


. A+ B= B+A (commutativity of addition); 
-(A+B)4+C=A4(B+C) (associativity of addition); 
AQUA) = (A> LA; 

-MA+B)=AAHAB; 

At pyA=AA+ PA; 

-A+FO=O+A=A. 
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The primary operation of linear algebra is the product of matrices. It, based on 
the two initial matrices, allows constructing a new matrix. 
In order to introduce this notion, let us first consider one special case. The product 


of a row of n elements by a column of n elements is the element, equal to the sum 
of the products of the respective elements of the raw and the column: 


by 
bo 

[ar a... an| | = ayby + aby + +++ + anda. (1.14) 
by 


Example 1.8 Calculate the product of the row [1,2,4, 8,16] by the column 
[16, 8, 4, 2, 1]7: 


[1248 16] 311649844449. 8 1 16 S86: (1.15) 


ep NY & CO 


Now let us consider the general case of matrices of an arbitrary size. 

Product of the matrix A = (a;;) of size m x n and the matrix B = (b;;) of 
size n x p is the matrix C = (c;;) of size m x p, whose elements are expressed in 
accordance with the rule: 


n 
Cij = Se aixbej- (1.16) 
k=1 


The product of matrices is written as C = A- BorC = AB. 

Thus, the element c;; of the matrix C = AB is the sum of the products of the 
elements of the i-th row of the matrix A by the respective elements of the j-th 
column of the matrix B (Fig. 1.2). 
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bij 
bo; 
~ P aig .-- | j i h_— | ~ 
Ding 
j-th 
j-th column 
column 
Cig = Do inde; 
k=1 
Fig. 1.2. Multiplication of matrices (a;;) and (b;;) 
. Tee ; 1 <2 
Example 1.9 Execute the operation of multiplication of matrices and 
—34 
ae me a 
5 4] 
12) =3 8 | 1-(—3)+2-5 1-6+2-(-4) 
—34 5 —4 (—3) - (—3) + 4-5 (—3)-6+ 4: (—4) 
7 —2 
= . (1.17) 
29 —34 


Note The definition of the product of matrices introduced above looks less natural 
than the definition of the sum. However, exactly this method of introducing the 
operation of multiplication allows, in matrix algebra, preserving many properties 
typical for the product of real numbers. 


The following properties are met: 


1. A(B+C) = AB+AC, (B+C)A = BA+CA (distributivity of multiplication 
with respect to addition); 

2. (AB)C = A(BC) (associativity of multiplication); 

3. OA = AO = O (property of zero matrix); 

4. [A= AI = A (property of identity matrix). 
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In the general case, in the product of matrices, their order is essential, which is 
demonstrated by the following example. 


2-1 3 0 
Example 1.10 Let A= and B = 
10 1-1 
Then we have 
eis 2-1 3 0 _ 2-34+(-1)-12-04+(-1)-(-1) = 51 
10 1-1 1-34+0-1 1-0+0-(-1) 30 


(1.18) 


at the same time, the product of matrices, executed in a different order, is equal to 


pep fe ON (PSE a) See, SS iH00. | sea 8 
Lat) eo one DI en Behe 1s 
(1719 


So, matrix multiplication is non-commutative, i.e. when the multipliers are 
permuted, the result may change. 

As it directly follows from the definition of matrix product, they can be multiplied 
when and only when the number of rows of the first multiplier—matrix A, coincides 
with the number of rows of the second multiplier—matrix B. We should also note 
that the existence of the product AB does not imply the existence of the product 
BA. 


Matrix Commutator and Matrix Trace 
Matrices A and B are called commuting (or permutation), if AB = BA. The 
commuting matrices are necessarily square and have the same order. 

Commutator of two square matrices of the same order is the value 


[A, B] = AB-— BA. (1.20) 
By definition for commuting matrices, the condition [A, B] = O is met. 


Example 1.11 Calculate [A, B], if 


a1. (02a 
rh rg Pe) a ms es A (1.21) 
= esl 200 
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Solution 
-1 2 -2 1 0-1 —-7 2 3 
AB=/]2 1 -1 —-111]=)-1 1-1], (1.22) 
—-1-1-1 200 —2-1 0 
1 0-1 —-1 2 -—2 0 3 -l 
BA=]|-11 1 2 1-1)=] 2 —2 01], (1.23) 
200 —-1-1-1 —2 4 -4 
—-7-1 4 
[A,B] = AB— BA=] —3 3 -]]. (1.24) 
0-5 4 


Example 1.12 Prove Jacobi’ identity, true for the commutators of any matrices of 
sizen Xn: 


[[P, QO], RJ+[[Q, KR], P]+([R, P], QO] =O. (1.25) 
Proof Use the definition of commutator [P, Q] = PQ — QP, then 


I[P, Q],R] =[PQ— QP, R] = (PQ—- QP)R— R(PQ- QP) 
= POR-— OPR—RPO+ROP. (1.26) 


Then, in a similar manner, we will present the remaining summands in the sum: 


[[Q, R], P] = ORP — ROP — PQR+ PRQ, (1.27) 
[[R, P], 0] = RPQ— PRQ— QRP+QPR. (1.28) 


The sum of the values (1.26), (1.27) and (1.28), as is easy to see after reducing 
such summands, is equal to zero. Thus, the Jacobi identity is proved. 


2Carl Gustav Jacob Jacobi (1804-1851), German mathematician. 
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Trace tr A of the square matrix A = (aj;), where 1 < i, j <n, is the sum of its 
diagonal elements: 


n 
trA = ae (1.29) 


i=l 
Another designation of the matrix A trace is Sp A, from a German word “spur’’. 


Example 1.13 The trace of the identity matrix J of size n x n is equal to its order: 
trl =n. 


Estimate of the Number of Multiplication Operations When Multiplying 
Matrices 
In order to estimate the working time of the computing algorithms it is necessary to 
know the number of the multiplication operations executed in the program. Let us 
determine this number for the matrix multiplication operation. 

Let both product matrices be square and have the same order n. Then AB 
is the matrix n x n. For calculation of all the result elements we will need n* 
multiplications of row by column. The multiplication of row by column contains 
exactly n multiplications of real numbers. Hence, in order to determine the product 
AB we will need n3 real multiplications. 


Note There exist non-elementary algorithms that allow multiplying matrices in a 
smaller number of operations. Among the most well known of such algorithms is 
Strassen’ algorithm. Note that the advantages of using Strassen algorithm and 
similar non-elementary methods of matrices multiplication become apparent only 
for sufficiently large matrix size values [20]. 


Modern scientific and technical tasks, game industry projects, and technologies 
of augmented and alternative reality require fast execution of matrix operations on 
the mass data. This is why such actions with matrices as transposition, multiplication 
and others are presently executed using the parallel programming methods. Working 
with matrices on high-performance parallel systems has its own peculiarities associ- 
ated with the methods of data presentation in the computer memory and the methods 
of interprocessor communication. In the works [5, 27, 56] basic algorithms of matrix 
algebra are presented, adapted for application on high-performance computing 
systems. The examples of implementation of such algorithms are provided in [42]. 


Note As was noted above, the matrix elements are real and complex numbers. Apart 
from this, the elements may also be functions on which algebraic operations can 
be performed. In such a case we say about functional matrices. Later on, unless 
otherwise specified, only numerical matrices are considered. 


3Volker Strassen (born 1936), German mathematician. 
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1.2 Concept of Algorithm, Correctness of Algorithms 


In the Sect. 1.4 will be shown algorithms for working with matrices in Python 
language. This is why, below we will preliminarily consider the concept of 
algorithm and algorithm correctness and show how to estimate their efficiency. 

Algorithm is an exact prescription defining the computational process leading 
from the varying source data to the result sought for (the data is the ordered set of 
symbols) [48, 49]. In other words, an algorithm describes a certain computational 
procedure with the help of which a computational problem is solved. As a rule, 
the algorithm is used for solving some class of problems rather than one certain 
problem [16, 69]. The term “algorithm” derives from the name of a medieval 
mathematician al-Khwarizmi.* 

The concept of algorithm belongs to basic fundamental notions of mathematics. 
Many researchers use various definitions of algorithm that differ from each other. 
However, all definitions express or imply the following algorithm properties [48, 
49]. 


1. Discreteness. An algorithm must represent a process of problem solving as a 
sequential execution of separate steps. Execution of each algorithm step takes 
some time, and each operation is only executed wholly and cannot be executed 
partly. 

2. Elementary character of steps. The method of execution of each command 
should be known and simple enough. 

3. Determinateness (from Latin déterminadre—determine). Each successive step of 
the algorithm operation is uniquely determined. The result should be the same 
for the same source data. 

4. Directedness. It should be known what to consider as the algorithm operation 
result. 

5. Mass character. There must be a possibility to apply the algorithm to all 
collections of source data from the certain pre-fixed set. 


Correctness of Algorithms 

Consider the algorithm A that solves a certain computational problem. The pos- 
sibility of applying this algorithm in a computer program requires justification of 
correct problem solution for all input data, i.e. we should carry out the proof of the 
algorithm A correctness. For this, we need to trace all changes of the variables’ 
values that occur as a result of the algorithm’s operation. From the mathematical 
point of view, we are talking about establishing the true values of some predicates 
describing the variables. 


4al-Khwarizmi (Muhammad ibn Musa al-Khwarizmi) (about 780—about 850), a distinguished 
mathematician, astronomer, geographer and philosopher. The term “algebra” derives from the 
name of his work containing the general techniques for solving problems reduced to several 
algebraic equations [9]. 
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Assume that P is a predicate true for the input data of the algorithm A, Q is a 
predicate taking a true value after completion of A. The introduced predicates are 
called precondition and postcondition, respectively. 

Proposition {P}A{Q} means the following: “if operation of the algorithm A 
starts from the true value of the predicate P, then it will end at the true value of 
Q”. We obtain that the proof of correctness of the algorithm A is equivalent to 
the proof of trueness of {P}A{Q}. The pre- and postcondition together with the 
algorithm itself are referred to as the Hoare? triple. The Hoare triple describes how 
the execution of the given fragment of the computer program changes the state of 
computation [59]. 


Example 1.14 Let us prove the correctness of the algorithm of exchanging the 
values of two variables. 
Listing 1.1 


# Exchanging of values of variables a and b 
temp 


a b 
b temp 


Proof Let the variables a and b take the following values: a = ag, b = bo. 
Precondition: P = {a = ao, b = bo}, postcondition: Q = {a = bo, b = ao}. 
Substitute the values of the variables a and b into the body of the algorithm A, 

which will result in the following values: temp = ao, a = bo, b = ao. This is 

why the predicate {P}A{Q} takes the true value, and thus the correctness of the 
algorithm swap(a, b) is proved. 


1.3 Estimation of Algorithm Efficiency 


An important task of the algorithm analysis is the estimation of the number 
of operations executed by the algorithm over a certain class of input data. The 
exact number of elementary operations does not play any significant role here, 
since it depends on the software implementation of the algorithm, the computer’s 
architecture and other factors. This is why the algorithm’s performance indicator is 
the growth rate of this value with the growth of the input data volume [16, 51]. 

In order to analyse the algorithm efficiency, it is necessary to estimate the 
running time of the computer that solves the set problem, as well as the volume of 
memory used. The estimate of the running time of the computing system is usually 
obtained by calculating the number of elementary operations performed during 
computations (such operations are called basic operations). With the supposition 


3Charles Antony Richard Hoare (born 1934), English scientist specializing in computer science. 
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that one elementary operation is performed in a strictly defined time, the function 
J (n), defined as the number of operations during computations on input data of size 
n, is called a time-complexity function [51]. 

In algorithm analysis, the number of basic operations is estimated, and it is 
assumed that execution of each of the listed below operations takes constant and not 
depending on n time [52]. 


1. Binary arithmetic operations (+, —, *, /) and operations of comparison of real 
numbers (<, <, >, >, =, #). 

2. Logic operations (and, or). 

3. Branching operations. 

4. Calculation of the values of elementary functions for relatively small values of 
their arguments. 


During implementation of matrix algorithms, in most cases the basic operation 
is considered as the operation of multiplication of two real numbers. 

Let us consider the functions f, g: N — (0, oo). Assume that g(n) describes the 
time complexity of the known algorithm. 

It is said that a function f(n) belongs to the class O(g(n)) (read as “big O of 
g’’), if the growth rate of f(n) does not exceed the growth rate of g(n). We give a 
strict definition: f(n) = O(g(n)), if, for all values of the argument n, starting from 
a threshold value n = no, the inequality f(n) < cg(n) is valid for some positive c: 


O(g(n)) = {f(n): dc > 0, no € N_ such that for all n > no 
f(@™) < cg(n) is valid}. (1.30) 


The notation f(n) € O(g(n)) can be read as “the function g majorizes_ the 
function f”. 

Since O(g(n)) denotes a set of functions growing no faster than the function 
g(n), then, in order to indicate that a function belongs to this set, the notation 
f(™) € O(g(n)) is used. Another notation is rather common in the literature: 
f(a) = O(g(n)), where the equals sign is understood conventionally, namely in 
the sense of belonging to the set. The class O(g(n)) are referred also to as the “big 
O notation”. 


Example 1.15 Prove that the asymptotic estimate 3n? € O(n‘*) is true. 


Proof According to the definition (1.30) we should prove that there exists a positive 
constant c such that starting from some number no, the inequality 3n? < cn* is met, 
or (cn — 3)n3 > 0. 

Assume that c = 3, then, starting from no = 1, the last inequality is true. Then, 
3n3 € O(n"). 


Note The notation O(f (f)) is used not only for t + oo, but may also be generalized 
in case of an arbitrary limit value of the argument t — fo. For example, the 
expression 


f@) = O(g(t)) at t > to (1.31) 
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means that the limit of the ration limit of the functions f(t) and g(t) is taken at the 
point t = fo: 


tim 22 
im — 


= const > 0. (1.32) 
tt g(t) 


1.4 Primitive Matrix Operations in Python 


In the programs in Python language, the matrices are presented in the form of two- 
dimensional arrays [62]. For the arrays in Python, a special term “lis?” is used. List 
is an ordered sequence of numbers or other presentable in the computing system’s 
memory objects. Thus, the matrix is specified in the form of a list, whose elements 
are lists of the same length. In particular, the matrix 


11 13 15 17 
A= | 9-8 -7-6 (1.33) 
—1-—2 12 14 


in a Python program will be presented as 

A=[[11, 13, 15, 17], [-9, -8, -7, -6], [-1, -2, 12, 
14] ] 

As is seen, for formation of a list an enumeration of its elements separated by 
commas is used. In order to address the matrix elements, square brackets are used, 
forexample,A[i, j]. 

Note that the indices of arrays in Python begin from zero rather than one. For 
example, for the matrix (1.33) we have the following equalities: 

A[o, 0] = 11 

A[2, 1] = -2 


Note The agreement about zero starting values of indices is also used in such 
programming languages as C and Java [39]. However, in Fortran and Pascal 
languages, the indices by default begin from one [12, 74]. 
Let us show a program code used for inputting the matrix elements from the 
console and outputting the matrix to the console (see Listing 1.2). 
Listing 1.2 


def read_matrix from_console(): 
int (input()) # Number of rows 


int (input ()) # Number of columns 


[] 


Nn PWN 


for i in range(n): 
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7 row = input().split() 

8 for j in range(m): 

9 row[j] = int(row[j]) 

10 A.append (row) 

11 return A 


def print _matrix_to_console (A): 
for row in A: 


for elem in row: 
print(elem, end=’ ') 


print () 


Example of call of functions read_matrix_from_console() and 


print_matrix_to_ consol 


e(): 


A = read matrix from console() 


print_matrix_to_ consol 


e (A) 


The following functions presented in Listing 1.3 perform standard operations on 
matrices: addition, multiplication by a number and transposition. 


Listing 1.3 


= “LenB 
len(A[0] ) len(B[0]): 
[[0 for j in range(len(A 


in range (len(A))] 


for i in range(len(A)): 
for j in range(le 
Clillj] =A 


len(A[0])): 
[i] [3] + Bil] [j] 


scalar(A, alpha): 


[{0 for j in range(len(A[0]))] \ 
for i in range(len(A))] 


for j in range(len(A[0])): 


= alpha « A[il] [3] 


1|def matrix _add(A, B): 
2 if len(A) = 

3 

4 

5 for i 
6 

ti 

8 

9 

10 

11 return C 

12 

13 

14|def matrix_mult_by_ 
15 

16 

17 

18 for i in range(len(A)): 
19 

20 Ci] [9] 
21 

22 return C 


N 
w 
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def matrix_subtract(A, B): 
if len(A) == len(B) and \ 
len(A[0]) == len(B[0]): 
Cc = [[0 for j in range(len(A[0]))] \ 
for i in range(len(A))] 


for i in range(len(A)): 
for j in range(le 
Clillj] =A 


len(A[0])): 
[i] [3] - Bla] [3] 


return C 
def matrix_transpose (A): 
[[0 for j in range(len(A))] \ 
for i in range(len(A[0]))] 
for i in range(len(A) ) 
for j in range(len(A[0])): 
C{j]l fi] = Alil [jl 


return C 


An important function calculating the product of matrices by formula (1.16) is 
presented in Listing 1.4. 
Listing 1.4 


# Multiplication of matrices A and B 

def matrix _mult(A, B): 
[[0 for j in range(len(B[0]))] \ 
for i in range(len(A) )] 


for i in range(len(A)): 
for j in range(len(B[0])): 
s = 0 


for k in range(len(B)): 
s += Ali] [k] * BIk] [j] 


Ree 
FOO ANN NN FF WN 


15 return C 
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Table 1.1 Matrix functions and NumPy procedures 


Name Comment 
dot (A,B) The product of matrices A and B 
trace (A) Trace of matrix 
linalg.inv (A) Inversion of matrix 
linalg.det (A) Determinant of matrix 


linalg.matrix power (A, n) | Raising matrix A to power n 


linalg.eigvals (A) Calculation of eigenvalues of matrix 

linalg.eig (A) Solution of problems on eigenvalues and eigenvectors, 
the function return all solutions (A, X) of the system 
AX = AX 

linalg.solve (A, B) Solution of the system of linear equations AX = B with 


vector B on its right side 


1.4.1 NumPy Library 


For high-performance calculations, the library NumPy with open source code is 
widely used [46, 57]. In this package, for presentation of matrices in the memory, 
the data type array is introduced. Apart from this, when including NumPy using the 
command 
from numpy imports 
a great number of matrix functions and procedures become available. The most 
important ones are listed in Table 1.1. 
In particular, transposition of arbitrary rectangular matrices is performed using 
the method “. T”: 
Asarray([[11, 13, 15, 17], [-9, <8, -7, -6], [-1, -2, 
12, 14]]) 
AE 
To the console (more specifically, into the standard output stream) will be sent 
A=array([[11, -9, -1], 
[13, =8, =-2],; 
[15, =7, 12], 
[17, -6, 14]]) 


1.5 Matrix Algorithms in the Graph Theory 


As an example of the algorithm’s operation with matrices, let us consider one of the 
important algorithms of the graph theory—Warshall® algorithm [1, 61], which is 
used for calculating the reachability matrix of the specified directed graph D(V, E). 


Stephen Warshall (1935-2006), American researcher in the field of computer sciences. 
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First we will recall the basic notions of the graph theory. Everywhere below, the 
multiplication operation signs “-” and “x” will be considered as equivalent. In some 
cases, when it is clear that we are dealing with multiplication, they may be omitted. 

Graph is a pair G = (V, E), where V is the set of vertices, while E is the set 
of edges, connecting some pairs of vertices [22, 31, 55, 73]. In directed graphs, 
the edges are the ordered pair of vertices, i.e. it is of importance which vertex is the 
beginning of the edge and which one is the end. Directed graphs are also referred to 
as digraphs. 

A drawing where the graph vertex is shown as points and the edges are shown as 
segments or arcs is called a graph diagram. 

Two vertices u and v of the graph are adjacent, if they are connected by the 
edge r = uv. In this case it is said that the vertices u and v are the endpoints of the 
edge r. If the vertex v is the endpoint of the edge r, then v and r are considered to 
be incident (from Latin incédere—to distribute). 

The number of elements (cardinality) of any set, for example V, is denoted as 
|V|. 

Adjacency matrix M is a binary matrix of a relation over the set of vertices of 
the graph G(V, E), which is specified by its edges. The adjacency matrix as the size 
|V| x |V|, and its elements are determined in accordance with the rule 


1, ifedgeij € E, 


MG, j) = (1.34) 

0, ifedgeij ¢ E. 
A path of length k in the graph G is a sequence of vertices vg, v1, ..., Ug such 
that Vi = 1,...,k the vertices vj;_; and v; are adjacent. There are also considered 


trivial paths of the form v;, v;. For undirected graphs, paths are also called routes. 
The length of the path is the number of edges in it, taking into account the 
iterations. 


Example 1.16 Consider a digraph D(V, E), the set of vertices V and the set of 
edges E of which are specified as follows: 


V ={a,b,c,d,e}, E = {ab,ae, bc, bd,dc, de, ec}. 


The graph D(V, £) is presented in Fig. 1.3. 
The adjacency matrix M of the digraph D has the form: 


abcde 
01001 
00110 
M= c|00000}. 
dj}00101 
e}00100 


8 


> 


(1.35) 
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Fig. 1.3. The digraph 
D(V, E) to the Example 1.3 


Reachability matrix M* of the digraph D(V, E) is a logic closing matrix by 
transitivity relation E. The reachability matrix stores the information about the 
existence of paths between the digraph vertices: at the intersection of the i-th row 
and the j-the column stands | when and only when there exists a path from the 
vertex v; to vj. M* may be calculated by a formula using the logical operation 
or [29] 


M* = MorM’or...orM", (1.36) 


where 7 is the number of vertices of the directed graph, i.e. n = |V|. Note, that 
determining the elements of the matrix M* by formula (1.36) is associated with 
a considerable volume of calculations, this is why for the digraphs with a great 
number of vertices, the Warshall algorithm is used, also known as the algorithm 
of Roy’—Warshall [61]. 

The Warshall algorithm is based on formation of a sequence of auxiliary binary 
matrices Ww, wi), aise we), where n = |V|. The first matrix is set equal to the 
adjacency matrix M of the digraph. The elements ws, where | <i, j,k <n, are 


calculated by the rule: wi = |, if there exists a path connecting the vertices v; 
and v; such that all the inner vertices belong to the set Vz = {v}, v2,..., vg}, and 


Wi? = 0 otherwise. Note that the inner vertex of the path P = uj,..., ¥,..., Uj 
is any vertex vj, 1 </ <n, belonging to P, except the first v; and the last v;. The 
resulting matrix W“ appears to be equal to W“) = M*, since Mi; = | when and 
only when there exists the path v;, ..., v;, all inner vertices of which are contained 
in V = {v1, v2,..., Un}. 

The principal moment is that the matrix W“) can be obtained from W“—) as 
follows. The path v;,..., vj, containing the inner vertices only from the set V;, 
exists when and only when one of the conditions is fulfilled: 


1. there exists a path v;,...,uj; with inner vertices only from Vy_; = 
{v1, 02, ..., Uk-1}5 


7Bernard Roy (born 1934), French mathematician. 
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2. there are paths vj,..., vg and vug,..., vj, also containing inner vertices only 
from V;z_}. 


We obtain two cases: either we? = |, if vz is included into the set of vertices 


allowed at this stage, or vy = | and we = |. Therefore, using the logical 


operations or (disjunction) and and (conjunction) we may write 
k k-1 k-1 k-1 
WY = Wi or (Wy? and WF). (1.37) 


Let us show a respective algorithm for constructing M* by the specified 
adjacency matrix M of size n x n, where n > 1. The intermediate matrices W, 
where 0 < k < n — 1, should not necessarily be stored in memory until the end 
of the algorithm’s operation, this is why, in the suggested realization, the elements 
W—D are substituted by the elements of the subsequent matrix W. 


Listing 1.5 
1)def Warshall_algorithm(M) : 
2 len (M) 
3 
4 [[0 for j in range(n)] \ 
5 for i in range(n) ] 
6 
7 for i in range(n): 
8 for j in range(n): 
9 W[i] [3] = M[i] [j] 
10 
11 for k in range(n): 
12 for i in range(n): 
13 for j in range(n): 
14 Wil [fj] = Wi] [3] or \ 
15 (W[i] [k] and W[k] [j]) 
16 


17 return W 


Correctness of algorithm WarshallAlgo can be proved by the method of math- 
ematical induction (see the description of this method below on page 53) [61]. 
Solution of the problem for finding M* is also investigated in Problem 1.39 and 
in [61]. 


Example 1.17 Let digraph D be specified (Fig. 1.4). Construct the reachability 
matrix M*, using Warshall algorithm. 
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Fig. 1.4 Directed graph D 


d C 


Solution The matrix W© coincides with the adjacency matrix of the digraph and 
has the form 


abcd 
0100 
1010 
0000], 
d}1010 


S 8 


wO = 


io) 


Calculate W). If Wy = = |, then the respective element Wa? is also equal to 
1: WY? = 1. If We = = 0, then attention should be paid to the elements of the first 
row and the first coe standing at the intersection with the j-th column and the 
i-the row: if v= wo = ) 1, then Wy? = = 1. The condition Wi = w? y=] 
is fulfilled for the two pairs (i, j), namely fori = j = 2 and Z =4,j = 2. 
Then, WS) = = wy? = |, and all the rest elements wi ; coincide with the respective 
elements of the matrix W. For illustration, in the notation of the matrix we will 
highlight in bold and underline the elements W“!) that have changed values at this 
step: 


abcd 
a|0100 
b}1110 


c;} 0000 

d 4 110 
Then we will calculate W). Consider the second row and the second column 
of the matrix W“!). The elements W“”) that are positioned in the same row with the 


wh — 
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elements of wy Ys 1 from the second column and in the same column with the 
elements of W3;’ = | from the second row, will change their values in W“! for 1. 


Such will be the elements wey and Wi. The rest of the elements W®) coincide 
with the respective elements of the matrix W‘)). 


abcd 
1110 
1110 


Ww® = 
nye! 


io) S 8 


QQ 


1110 


At the next step, the vertex c is added to the set of vertices. This does not result 
in appearance of new elements with value 1. 


abcd 
1110 
1110 
0000] 

d}1110 


Q 


> 


w® = 


At the final step, we obtain W = W®), and the reachability matrix of digraph 
D will have the form 
abcd 
aj1110 
b} 1110 
c} 0000] 
d|1110 


Review Questions 


1. Define diagonal matrix, upper triangular matrix, lower triangular matrix, 
symmetric matrix and binary matrix. 
2. How is the matrix transposition operation performed? 
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. How is the Kronecker delta determined? 

. What matrices are called symmetric? antisymmetric? 

. Formulate the definition of product of two matrices of size m x n andn x p. 
. What is the commutator of the matrices A and B? 

. Define trace of a square matrix. 

. What is algorithm? 

. Enumerate the properties of an algorithm. 

. How is the algorithm efficiency estimated? 

. Explain the meaning of the notation O(f(n)). 

. Describe the presentation of matrices in Python. 

. Enumerate the basic matrix functions and the procedures of NumPy library. 
. How are graphs presented in the computer memory? 

. For solution of what problem is the Warshall algorithm used? 


Problems 


21-1 —210 
1.1. Calculate 3A + 2B, where A = ,B= ‘ 
1 
1.2. Calculate AB, where A = ,~B=]2 
3 


O01 4 —322 


1 


1-10 
—1|. Find B’ AT and 


23 4 
0 


(A B)?. 


1.3. Let the matrix A = (a;;) be of size n; x nz and the matrix B = (b;;) be of 


size n2 X nj. Prove that the following equality is fulfilled: 
(AB) = BTA‘. (1.38) 


i.e. the transposed product of two matrices is equal to product of the 
transposed matrices in reversed order. 


1.4. Write the matrices of size 3 x 3, whose elements are determined by the 


formulas: 

() aij = (-DI; 
i+j+li- jl 

Q) bj = 


2 
BG) epS € = 27 G25 
(4) dij = sin(|i — j)). 


Calculate the sum of all elements S of each matrix. 
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1.5. Let A = (a,j) be a square matrix of order n > 3. Using the summation 
symbol )-, write the following values: 


(1) the sum of the elements of the third row; 

(2) the sum of the elements of the second column; 

(3) the sum of squares of the diagonal elements; 

(4) the module of the sum of the elements positioned on the secondary 
diagonal. 


1.6. How to write the sum of the elements of the square matrix positioned above 
the main diagonal using the summation sign? How to do this for the elements 
positioned below the main diagonal? 

1.7. A student carrying out an experiment in a chemical laboratory has acciden- 
tally spilled a reagent on an algebra notes page, where antisymmetric matrix 
was written. As a result, it was impossible to read some of its elements. If we 
denote such elements by symbol “?’, the notation will look as 


O1-1? 
202 2 
2 2D? 
72? 0? 
Restore the unknown elements and write the original matrix. 
1.8. Determine the number of binary matrices of n rows and n columns that are 


(1) symmetric and 
(2) antisymmetric 


relative to the main diagonal. 
1.9. Calculate 


001 
1-1 3 
112 4 1 —2 
a) 2 ye 2B) 
2.23 1 3 —4 
1 1 
334 
1.10. Consider the binary matrices 
010 001 
Q1=/100], Q2=]|010 
001 100 


Calculate 0102, 0201, QF and 03. 


1.14. 


*1.15. 


1.16. 


1.17. 


1.18. 


1.19. 


1.20. 
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. For what matrices D of the second order the square of D? is equal to the zero 


matrix? 


. Let the matrices L = [—2,—-1,0,1,2], M = [0, 2,4, 6, 8] be specified. 


Calculate the products of LM? and M? L. 


. The elements of the matrix G = (g;;) are determined in accordance with the 


rule 


1, ifi>j, 
Zij = 
4 Jo, ifi <j. 


What are the elements of the matrix G* equal to? 

At the examination in linear algebra, a student says that multiplication of two 
non-zero matrices will necessarily result in a non-zero matrix. Is the student 
right? 

Let us denote by x; the number of processors produced by some plant starting 
from the beginning of the year. In particular, the plant’s operation during the 


first month is described by the vector X = [x1,x2,...,x3 i. Determine 
the matrix D, which should influence the X, in order to obtain the vector 
Y = [x2 — X1,%3 — X2,...,Xn — Xn-11": Y = D.- X. The vector Y reflects 


the daily production capacity gain of the plant. 
Calculate the product of the functional matrices A(w) B(@)A(q), if 


| cosg sing 4 |! 0 0 | 
A(g)=j-singcosg0;, B@)=)0 cosé sind}. 
| 0 0 | E —sin@ cos | 


Calculate the commutator [A, B], if 


753 623 

A= /132|,B=]521 

22°77 116 
Calculate the commutator [A, B], if A is an arbitrary matrix, B = I is 


identity matrix of the same order as A. 
Consider the matrices 


0-10 00 0 001 
P=]100|], B= /00-1}, B=] 0 00]. (1.39) 
00 0 01 0 —-100 


Calculate the commutators [P,, Po], [P2, P3] and [P3, Pi]. 
Is it true that for any square matrices A, B and C of the same size the equality 
[A + B,C] = [A, C] + [B, C] is fulfilled? 
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1.21. 


1.26. 


1.27. 


1.28. 


Prove that for any matrices A, B and C of the same size, the identity is true 


[AB, C] = A[B,C]+[A, C]B. (1.40) 


. Is it true that for any square matrices A, B and C of the same size the equality 


[A,[B, C]] = [[A, B], C] is fulfilled? 


. Given the square matrices A and B of the same order. In what case the 


equality (A + B)* = A* + 2AB + B? is true? 


. Is it true that if [A, B] = O and[A, C] = O, then the matrices B and C are 


commuting? 


. Suppose that the sizes of the matrices A, B and C are equal to n; x no, 


n2 X n3 and n3 x n4, respectively. In order to calculate the product of ABC, 
the multiplication operations can be executed in two ways: (A - B) - C or 
A-(B-C). With what relation between the variables n;, 12, n3 and nq the 
calculation using the first method—as (A- B)-C—will require less operations 
of multiplication of real numbers in comparison with the second method? 
Prove the correctness of the algorithm of change of values of two variables 
without using the auxiliary variable. 

Listing 1.6 


Exchanging of values of variables a and b 
without using the auxiliary variable 
a+b 


=a-b 
a - b 


nb WN 


Prove the correctness of the algorithm of addition of square matrices. 


Listing 1.7 
1)}# Addition of matrices A and B 
2|def matrix_add(A, B): 
3 if len(A) == len(B) and \ 
4 len(A[0]) == len(B[0]): 
5 C = [[0 for j in range(len(A 
6 for i in range(len(A) )] 
7 
8 for i in range(len(A)): 
9 for j in range(len(A[0])): 
10 Cli] fj] = Alil fj] + Bil [3] 


ee 
Ne 


return C 


Prove the correctness of the square matrix multiplication algorithm. 


28 
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Listing 1.8 
1}# Multiplication of matrices A and B 
2|)def matrix_mult(A, B): 
3 [[0 for j in range(len(B[0]))] \ 
4 for i in range(len(A))] 
5 
6 for i in range(len(A)): 
7 for j in range(len(B[0])): 
8 s = 0 
9 
10 for k in range(len(B)): 
11 s += A[i] [k] « B[k] [Jj] 
12 
13 
14 
15 


1.29. Prove the correctness of the optimized matrix multiplication algorithm. 


Listing 1.9 
1]/# Optimized multiplication 
2/# of matrices a and b 
3|def matrix _mult2(A, B): 
4 n len (A) 
5 
6 c [[0 for i in range(n)] \ 
7 for j in range(n) ] 
8 
9 [0 for i in range (n) ] 
10 
11 i in range(n): 
12 for j in range(n): 
13 s = 0 
14 
15 for k in range(n): 
16 D[{k] = BI[k] [j] 
17 
18 for k in range(n): 
19 s += A[i] [k] « D[k] 
20 
21 
22 


N 
WwW 


return C 
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1.34. 


*1.36. 


1.37. 


1.38. 


1.39. 


Note The considered option of matrix multiplication is optimized in such a 
manner as to preliminarily select the elements of the column b(k, j) into the 
intermediate array d, which may fully be placed in a fast cache memory. 


. Determine the number of operations of addition of two numbers executed by 


the algorithm matrix _add for the matrices of size N x N. 


. Determine the number of operations of addition and multiplication, executed 


by the algorithm matrix mult for the matrices of size N x N. 


. Suggest a method to decrease the number of addition operations executed by 


the algorithm matrix mult. 


. Let Ay, Az and A3 be numerical matrices of size 50 x 25, 25 x 30 and 30 x 10, 


respectively. Determine the minimal number of multiplication operations 
required to calculate the product of A;A2A3 by the standard algorithm 
matrix mult, whose realization for square matrices is presented in 
Problem 1.28. 

Let A,, Az, A3 and Ay be numerical matrices of size 25 x 10, 10 x 50, 
50 x 5 and 5 x 30, respectively. Find the minimal number of multiplication 
operations, required for calculation of the product of A;A2A3A4 by the 
standard algorithm matrix mult. 


. Let A,, Az, A3 and Ay be numerical matrices of size 100 x 20, 20 x 15, 


15x50 and 50x 100, respectively. Find the minimal number of multiplication 
operations required for calculation of the product of A;A2A3A4 by the 
standard algorithm matrix mult. 

Prove that the number of methods for calculation of the product of the 
matrices AjA2...Am+1, m > 1, or, in other words, the number of ways 
to place brackets in this product, where Aj, A2, ..., Am+1 are numerical 
matrices of size n| x n2,nN2 X N3,...,Nm+1 X Nm+2, respectively, is equal to 
the Catalan® number C,,,, determined by formula 


1 
m+1 


= C(2m, m) for all m > 1, (1.41) 


where C(2m, m) is a binomial coefficient. 

Estimate the number of operations executed by Warshall algorithm for 
obtaining the reachability matrix of a digraph. 

Using the Warshall algorithm, calculate the reachability matrix of the digraph 
D, presented in Fig. 1.5. 

One of the ways to modify the Warshall algorithm consists in presentation 
of the rows of binary matrices as bit strings. In this case, for calculation of 
the elements of the matrices W™ for 1 <i < N, the bitwise operation or 
is used. Find the number of operation in the bit strings executed by the given 
realization of the Warshall algorithm. 


8Eugéne Charles Catalan (1814-1894), Belgian mathematician. 
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Fig. 1.5 To Problem 1.38 3 


1.40. Write the analytical expression for the function f(n), represented by the 
algorithm: 


Listing 1.10 


1}def £(n: int): 

2 temp 

3 

4 for i in range(1, n+ 1): 

5 for j in range(1, n + 1): 

6 for k in range(j, n + 1): 
7 temp += 1 

8 

9 return temp 


*1.41. Write the analytical expression for the function g(m), represented by the 
algorithm: 
Listing 1.11 


def g(n: int): 
temp 


wn 


for i in range(1, n + 1): 


for j in range(n, i - 
for k in range(1, 
temp += 1 


Coma rAIKHDN FS 


return temp 
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*1.42. Write the analytical expression for the function h(n), represented by the 
algorithm: 
Listing 1.12 


def h(n: int): 
temp 


for i in range(1, n + 1): 
for j in range(1, i+ 1): 


for k in range(1, j + 1): 
for 1 in range(1, k + 1): 
temp += 1 


OWA DN FPWN FS 


= 
Oo 


return temp 


1.43. Calculate the trace of the matrix A = (a;;), where 1 < i, j < n, whose 
elements are specified by the formulas: 


Cl) aij =i +f; 
(2) aij =i — js 
(3) ajj = In(i? + 7); 
(4) ajj = max(i,n — j); 
(5) aij = min | —, — }; 
ij 
(6) aij = sin(r(i + 2j)); 
n f ; 

(7) aij = ao G+ fj); 

1 

J 


1.44. Prove that the trace of the product of the square matrices does not depend on 
the order of multipliers: tr(AB) = tr (BA). 


Answers and Solutions 


1.1 Solution. 
F 63-3 —420 
We calculate the summands in the sum: 3A = »,2B= : 
03 12 —644 


2.3 =3 
—67 16 


We perform the matrix summation operation: 3A + 2B = 
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1.2 Solution. 
We will calculate the product AB relying on the formula (1.16) on page 7: 


1-10 
Avis _ 
pee | 
1-14+(-1)-2+0-31-1+(-1)-(-1)+0-0]  |-1 2 
PVE GAass SS. eaed a0 1 


We execute the matrix transposition operation: 


12 
aT=|_13|,87= 123 
~ 1-10 
ee Le142-(-1)43-0 1+242-343+4 


(AS Gps -3 40.4 


_ | -1 20 
24 
ee ee 
2 <4 


Note that the equality (AB)! = B7 A’ is fulfilled for any matrices A and B, for 
which the product AB is determined. 


1.3 Proof. 
Based on the definition of the matrix transposition and on the formula (1.16), the 
left part of the equality (1.38) consists of elements: 


ny 


((AB)"),, = (AB) ji =) ajadui, 
k=1 


where l <i<n3,1<j <n. 
Further, we represent elements of the right side of Eq. (1.38) in the form 


n2 ng 
beiajk = x ajkbki, 
1 k=1 


k= 


n2 


(BT A)i5 = YB )i(A ay = 
k=1 


where 1 <i<gn3,1 <j <1. 
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It is proved that for all possible numbers i and j elements of matrices (AB)? and 
B’ A’? coincide. In accordance with the definition of equality of matrices on page 4 
we get 


(AB)! = BTA’. 


1.4 Solution. 
Having calculated the matrix elements by the specified formulas, we obtain 


-1 1-1 
(1) 1 —1 1 |, the sum ofall elements S = —1; 
-1 1-1 
123 
(2) |223], S= 22; 
333 
212 
(3) |101], S=12; 
212 


0 sin1l sin2 
(4) |sinl O sinl |, S=4sin1+2sin2. 
sin2 sinl O 


1.5 Solution. 


(1) For the elements of the third row we have i = 3, j = 1,2,...,n. Therefore, 
the sum of the elements of the third row is presented in the form 5 a3j- 

(2) The elements of the second column may be written as ae hee i= 
1,2,...,. Then, their sum is equal to 3 aj2. 


i=] 
(3) For the diagonal pment, the indices i and j coincide: a;;. The sum of squares 


of such elements is 3 ae, 
i=1 
(4) The secondary diagonal is formed by the elements whose sum of indices is 


greater by one to the order of the matrix: i+ j = n+1. Therefore, j = (n+1)—i, 
the module of the sum of these elements is also equal to abs (= i(n+1—i) 
1.6 Solution. 


Let us write the sum of the elements of the square matrix positioned above the 
main diagonal. The number of the elements to be summed up in the rows above 
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the main diagonal decreases by one as the row number increases. Due to this, 
summation of the elements in the i-th row begins with i + 1, then the sum in one 
n 
row willbe )° aj;. Performing the summation operation over all rows, we obtain 
j=i+l 


Now let us write the sum of the elements positioned below the main diagonal. 
The number of the elements to be summed up below the main diagonal increases by 
one as the row number increases. Then, the upper summation threshold should be 

i-1 
equal to i — 1, and the sum for one row is calculated as > a; ;- For the sum of the 
j=l 
n i-l 
elements positioned below the main diagonal, we obtain )> )> aij. 
i=1 j=1 


1.7 Solution. 


By definition of an antisymmetric matrix, A’ = —A, then 
O az, a3, 7 Oo -!il 1 —-ay4 
1 O azz a4 —ax, O —2 -2 
Al = 2 a 
—1 2 a3x3 0 a31 —a32 —33 —a34 
aig 2 a34 a44 —7 —aqz O —a44 


Equating the respective matrix elements, we obtain 


0 1-1-7 

-10 2 2 
A= 

1-2 0 0 

7-20 0 


1.8 Solution. 
First of all we should note that the number of the elements on the main diagonal 
of the matrix is n, while the number of the elements lying above it is n(n — 1)/2. 


(1) The elements of the symmetric matrix lying below the main diagonal are 
uniquely determined by the upper triangular part of the matrix; this can be 
done by 2”""—))/? methods. There exist 2” methods for selection of diagonal 
elements. We obtain that the number of symmetric matrices of n rows and n 
columns is equal to 2"@—)/? ..7 = 2r@tD/2, 

(2) The main diagonal of the antisymmetric matrix is filled with zeros. In order to 
determine such a matrix, it is sufficient to specify the elements above the main 
diagonal; this can be done by 2”“"—)/? methods. Therefore, all in all there exist 
2"(—-1/? antisymmetric matrices of size n x n. 
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1.9 Solution. 


a) 


w 
bw 
K 
bw 
K 
| 
\o 
— 
oO 
w 
| 
& 


1.10 Solution. 


Q1:Q2=|100 


o 
o 
Il 
an 
o 
o 


(e) 
NN 
ll 
Oo 
— 
oO 
oO 
— 
oO 
ll 


100]|100 001 


1.11 Solution. 


b 
Let the matrix D have the form j | where a,b,c,d are unknown real 
cd 


numbers. 
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Let us square the D and equate the obtained result to the zero matrix. 


ab||ab a’+beab+bd| _ |00 
cd|led| |actdcbc+d?| |00 
Let us write the system of relatively unknown a, b, c, d: 
a*+bce=0, 
c(a+d)=0, 
b(a+d)=0, 
be+d* =0. 


From the obtained system it follows that the matrix D should have the form 


, and the variables a, b and c are bound by the condition a>+bc=0. 
c—a 


1.12 Answer: 
The size of the matrix L is equal to 1 x 5, the size of the matrix M7 is equal to 


5 x 1. Therefore, the matrices LM’ and M’ L have the sizes 1 x 1 (this is a scalar) 
and 5 x 5, respectively. Having executed the multiplication operations, we obtain 


0 
2 
LM!’ =[-2,-1,0, 1,2] | 4] = (—2)-04+ (-1)-24+0-44+1-6+2-8 =20: 
6 
8 
0 0 0000 
ph =4. 202 4 
M'i=| 4) [-2,—1,0,1,2)=| <8 4048 
6 —12 -606 12 
8 —16-808 16 
1.13 Answer: 
Let H = G”, then 
f-7+1, ifi> jf, 
hij = ‘ 


0, ifi < j. 
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1.14 Solution. 


t 0 00 
Let A = ,B= , where t ER. 


00 tO 
As is easy to see, the equality AB = O is fulfilled here, therefore, the product of 
two non-zero matrices may be equal to a zero matrix. The student is wrong. 


1.16 Solution. 


cosw sinw O}}1 O 0 
A(w)B@) = | —sinw cosy 0] | 0 cos sin 
0 O 1] |0-—siné@ cosé 


cosw sinwcosé@ siny sin@ 


= | -—siny coswcos6é cos y sind 


0 — sind cos @ 
AWW) BOA) 
cosw sinycos@ sinw sind cosy sing 0 
= | —sinw cosycosé@ cos yw sin@ | | — sing cos¢ 0 
0 — sind cos 6 0 0 1 


coswcosy—sinwcos@sing coswsing+sinwcos@cosg sinysin@ 


= | —siny cosy — cos w cos @ sing — sinw sing + cos W cos 6 cosy cos w sin@ 


sin @ sing —sin@ cos @ cos 0 
1.17 Solution. 
753 623 70 27 44 
AB=)132 521) =| 231018], 
22-7 116 29 15 50 
623 753 50 42 43 
BA=|521 132] = | 39 33 26 |, 
116 227 20 20 47 
20 -15 1 
[A, B] = AB— BA= | —16 —23 -8 ]. 
9° 50-3: 


1.18 Answer: [A, I] = O. 
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1.19 Answer: 
[Pi, Po] = P3, [Po, P3] = P; and [P3, P}] = Po. 


1.20 Solution. 
Yes, the equality [A + B,C] = [A, C] + [B, C] is true for any square matrices 
A, B and C of the same size. Simple calculations show that 


(A+B), C] = (A+B)C—C(A+ B) = AC—CA+BC—CB =[A, C]+[B, C]. 


1.21 Proof. 
Transform the right side of the equality (1.40), relying on the definition of (1.20): 


A[B, C]+ [A, C]B = A(BC — CB) + (AC — CA)B. 


Then, remove the brackets and indicate the similar summands, following which use 
the definition of the commutator once again: 


A[B, C]+[A, C]B = ABC —-ACB+ ACB —CAB = ABC— CAB = [AB, C]. 


Thus the identity (1.40) is proved. 


1.22 Solution. 
No, the equality [A, [B, C]] = [[A, B], C] is fulfilled not for all A, B and C, as 
the following counterexample shows: 


1 1 
ro 0 = 0 = 00 
00 00 10 
00 1 0 
[A, [B, Cl] = ’ [[A, B], C] = 0 1 


Therefore, for the arbitrary matrices [A, [B, C]] 4 [[A, B], C]. 


1.23 Answer: the equality is true in the case AB = BA, i.e. if the matrices are 
commuting. 


1.24 Solution. 
This is not true. The following counterexample can be given to the proposition 
from the problem situation: 


10 1 0 01 
A= 3 B= C= 


01 O11)" 10 
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In this case, the equalities [A, B] = [A, C] = O are fulfilled, while 


1 0 01 Ol; ]1 0 0 —2 
[B,C] = = 7 #0. 
0-1 10 10]; |0-1 —2 0 


Therefore, the matrices B and C are not necessarily commuting. 


1.25 Solution. 
Calculate the number of multiplication operations for each of the two methods. 


(1) During multiplication of each row of the matrix A by each column of the matrix 
B, n2 multiplication operations are performed. Since the number of rows is 71, 
and the number of columns is 73, as a result we obtain n1n2n3 operations. The 
matrix AB has the size n; x 3; then for obtaining the matrix ABC from AB 
and C we need n1n3n4 operations of multiplication of real numbers. Thus, the 
calculation of (A B)C requires n}n3(n2 +4) multiplication operations. 
Reasoning similarly, we obtain n2n4(n1-+n3) multiplication operations required 
for calculation by the scheme A(BC). 


es 


Ne 


As a result, calculation by the first method—as (A B)C—requires less operations 
of multiplication of real numbers when fulfilling the condition nyn3(n2 + 4) < 
n2n4(ny +n3). 


1.30 Answer: for calculation of the sum of two matrices A and B we will need N2 
additions for determining each of N* elements A + B. 


1.31 Solution. 

Each of N* elements of the matrix AB is calculated as a scalar product of two 
vectors of size N, which, respectively, requires N additions and N multiplications. 
The total number of both additions and multiplications appears to be equal to N - 
Neen 


1.32 Solution. 
The cycle body by the variable k may be rewritten in the form 


Cli] [j] = Ali] [0] * Blo] [3] 


for k in range(1, len(B)): 
Cfil [3] = Clil [3] + Alil [k] « B[k] [j] 


Then the number of additions reduces to N? — N?, while the number of 
multiplications remains unchanged. 


1.33 Solution. 

As is known, the number of multiplication operations required to calculate the 
product of matrices of size nj x nz and nz x n3 equals to njn2n3. (Recall that 
the product of two matrices is defined if the number of columns of the first one 
coincides with the number of rows of the second one.) Due to associativity of 
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the multiplication operation, the product A;A2A3 can be calculated in two ways: 
(A, A2)A3 and Aj (A2A3). 

In the first case, we will need 50-25-30+50-30-10 = 52500 multiplications and 
in the second case— 25 -30- 10+ 50-25-10 = 20000. So, the minimal number of 
multiplication operations required to calculate the elements of the matrix A;A2A3 
by the standard algorithm equals to 20 000. 


Note There exists an efficient algorithm for finding the order of multiplications in 
the product Aj A2...A,, > 2, with the minimal number of operations [16]. 


1.34 Answer: 7500. 
1.35 Answer: 255,000. 


1.37 Solution. 

In order to calculate W[i, 7] in the rows with numbers 14—15 (see the algorithm 
on page 21), two logical operations are required. Since this row is executed N x 
N x N = N° times, where N is the size of the adjacency matrix of the digraph, the 
full number of operations for obtaining M* equals to 2N°. 


1.38 Answer: 


O11l11 
01110 
M*=|01110 
01110 
01110 
N?(N +1 
1.40 Answer: f(N) = te 
N+1)2 1 
LAL Answer: g(N) = ~SFUENTD) 
N+1 2)(N 
1.42 Answer: h(N) = —— 


1.43 Answer: 


() tA=n(n+1); 
(2) tA = 0; 
n 
(3) tA =nIn2+2In(n!), where n! = |] i is factorial of the number 7; 
i=l 


3k, ifn = 2k, 
(4) rA= a where k € N: 
3k2 4+ 3k +1, ifn =2k +1, 
n 


1 
(5) trA = H,, where H, = >> — is the n-th harmonic number; 


i=l! 
(6) tA =0; 
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1 
(7) tA = —nkn + 2); 
1 
(8) tA = qn + 1)+ Hy, where H,, is harmonic number (see above). 


1.44 Proof. 
The trueness of the statement tr(AB) = tr(BA) follows from the chain of 
equalities: 


tr(AB) = Ye aijdji — So djiaij = So dijaji = tr(BA). 
i,j i,j ij 


Chapter 2 ® 
Matrix Algebra crests 


2.1 Determinant of a Matrix: Determinants of the Second 
and Third Order 


One of the fundamental notions of linear algebra is the determinant of a square 
matrix. Let us begin considering this notion with the determinants of the second 
and third order. 


: ; ; ay1 412 . : 
We will associate the matrix A = of size 2 x 2 with the number 
a21 a22 
a\1a22 — a21 412, (2.1) 


which is called the second order determinant of the matrix A and is denoted as 


he dtA= laa ||, (2.2) 


a21 422 


32 32 
Example 2.1 If A= , then A = =3-5-1-2=13. 
15 15 
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Third order determinant of the matrix A of size 3 x 3 is a number obtained by 
the following formula: 


411 412 413 


> 
l 


det A = |A| = Jap a22 a23 


431 432 433 


411422433 + 412423431 + 413421432 


1302243] — 41242133 — a1 1a23432. (2.3) 


The formula (2.3) can be easily remembered with the help of the triangle rule: 
the value of the third order determinant is equal to the algebraic sum of six terms, 
each being a product of three elements, one from each row and each column of the 
matrix A. The sign “+” is taken by the product of the elements lying on the main 
diagonal, and two products of the elements forming within the matrix triangles with 
bases parallel to the main diagonal. The sign “—” is taken by the products of the 
elements lying on the secondary diagonal, and two products of the elements forming 
triangles with bases parallel to the secondary diagonal (see Fig. 2.1). Unfortunately, 
triangle rule is only applicable to calculation of the determinants of matrices of 
size 3 x 3. The next Sect. 2.2 describes what to do with the matrices of greater 


Pape 


3 [aes 


sign “+” sign 


Fig. 2.1 The scheme “triangle rule” for calculation of the third order determinant 
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Example 2.2 
4 22 
det A = |A| = |—4 —-3 1 
3 0 3 


aAeg(=9) 3495153 4-0-0 
2.23). 3 93 = 4210 
= —36+6+04 18+ 24-0= 12. 


Note that the first order determinant for the matrix A = [a1], consisting of one 
element, is equal to the value of this element: det[aj1] = aj. 


2.2 Determinants of the n-th order: Minors 
The determinant of the n-th order, where n > 2, has the form 


@\1 G12 .--4in 


a2| A22 ... A2n 
A=|— | me (2.4) 


Qn Gn2 --+ ann 


By analogy with (2.3) it is a polynomial each summand of which is a product of 
exactly n elements of the matrix (a;;), and only one multiplier is included into the 
product from each row and each column of this matrix. 

Now let us turn to strict definitions. 

Let the ordered set of indices (kj, ko,...,k,) form some permutation of the 
numbers 1, 2,...,”. For n first natural numbers, as is easy to see, there exist n! 
pairwise different permutations. 

Inversion in permutation or simply inversion is the pair (k,, kz), where the 
greater number stands before the smaller one: kj > ko. 


Example 2.3 In the collection (4,3, 2,1) there are six inversions: (4, 3), (4, 2), 
(4, 1), (3, 2), (3, 1), (2, 1); there are no other inversions in this collection. 


Note The only permutation that contains no inversion is the identity permutation 
(1.250.235): 
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Determinant of the n-th order of the matrix (a;;) is the sum 


det A= YS (-1)° ark, 2ky -- -dnky: (2.5) 


perm 


where summation is performed over all permutations, o is the number of inversions 
in the permutation (k,, k2,..., ky). The determinant contains n! terms, half of which 
is taken with a positive sign and half with a negative sign. 


Example 2.4 Let us expand the definition (2.5) for the case n = 2. 


Solution For = 2 we have 2! = 1 - 2 = 2 permutations, namely kj = 1, kz = 2 
and kj = 2, kx = 1. The sum in det A will consist of two terms: a)1a@22 and aj2a21, 
taken with positive and negative signs, respectively: 


det A = D0 (1)? ak a2 = (—1)°araz2 + (-1)'a12a01. (2.6) 


perm 


The obtained formula exactly correlates with the definition (2.1), provided in 
Sect. 2.1. 


If we remove from the matrix A of the n-th order the i-th row and the j-th 
column, then we will obtain the matrix of the (n — 1)-th order, whose determinant is 
called the complementary minor of the element q;; of the matrix A and is denoted 
by M, ij- — 

The variable A;; = (— 1)'t/ Mj; is called the algebraic complement (cofactor) 
of the element a;; of the matrix A. 


Example 2.5 Given the determinant 


2 1 -2 
A=/3 -1 4], (2.7) 
35 0 


then the complementary minors of the elements of the matrix a23 and a3, and their 
cofactors are equal to 


My =|~ || =10 = (-1)?+3 Mp3 = (-1)-13 = — 
3 = = 10+3=13, Ao3 = (-1)7*3My3 = (-1)-13=-13, (2.8) 
5 


—2 3+1 
M3, = =4-2=2, A31 =(—1)°" M3, =2. (2.9) 
4 
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Theorem 2.1 (Laplace!) For the determinant of the matrix A = (aj;) the follow- 
ing formulas are true: 


n 

det A = )°ajjAij, i=1,2,...,n, (2.10) 
j=l 
n 

dct A= ade, FS 1 sont: (2.11) 
i=l 


i.e. the determinant can be expanded over any row or any column, using the 
cofactors of the matrix elements. 


The formulas (2.10) and (2.11) allow reducing the calculation of the determinant 
of the n-th order to calculation of the determinant of the (nm — 1)-th order. The 
procedure of reduction of order continues until we arrive at the determinants of 
the second and third order, which are relatively easy to calculate. 

Relations (2.10) and (2.11) are referred to as the Laplace expansions. 


Example 2.6 Let us calculate the determinant of the fourth order, expanding it in 
the first column: 


23-3 4 
1-1 2 3-3 4 3-3 4 
21-1 2 
A= =2/2 1 0/-2]2 1 O0/+6]1-1 2 
62 1 0 
3 0 -5 3 0 -5 3 0 -5 
23 0 -5 
3-34 
—2)1 -12) =2(-5+0+0-6-—0-— 10) 
210 


— 2(-15+0+4+0- 12—0—30)+6(15 — 18+0+ 12 —0—15) 
—2(0- 12+4+4+8-—6-—0) = —42+4 114-364 12= 48. 


'Pierre-Simon, marquis de Laplace (1749-1827), French mathematician, physicist and astronomer. 
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2.3 General Properties of Determinants: Elementary 
Transformations of a Matrix 


Let us enumerate the general properties of determinants [43, 63]. 


Property 1 During transposition of a matrix, its determinant does not change: 
det A? = det A. (2.12) 
Property 2 After exchange of two rows (columns) of a matrix, the determinant 


changes its sign. 


Property 3. The determinant of a matrix with two identical rows (columns) is equal 
to zero. 


Property 4 The determinant of a matrix with two proportional rows (columns) is 
equal to zero. 


Property 5 The determinant of a matrix will not change if to all elements of its row 
(column) are added the respective elements of another row (column), multiplied by 
the same number. 


Property 6 The determinant of a product of matrices is equal to the product of 
determinants, i.e. 

det(A - B) = det A- det B. (2.13) 
Property 7 The determinant of a triangular matrix coincides with the product of 
the elements standing on the main diagonal. 


Property 8 If all the elements of some row (column) of a matrix are multiplied by 
the same number, its determinant will be multiplied by this number. 


Property 9 If all the elements of the i-th row of a determinant are given as a sum 
of two terms ajj = uj + v; for j = 1,...,n, then the determinant is equal to the 
sum of two determinants of the following form: 


aii a\2 cee Ain Q\] 412.-. Gin a\| 412 .-- Gin 

Uy +Vy uU2+02...Un+ Un| = | Uy U2 Un | +} vy v2 Un 

ani GQn2 +++ nn Gnl Gn2 +--+ Gnn Qnl 4n2--- Gnn 
(2.14) 


Property 10 If all the elements of some row or column of a matrix are equal to 
zero, then its determinant is equal to zero. 
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Elementary transformations of a matrix are such transformations that are 
associated with Properties 2, 5, 8: exchange of two rows (columns); multiplication 
of a row (column) by a non-zero number; addition to one of the matrix row of 
another one, multiplied by any non-zero number (the same for the columns). 

With the help of the elementary transformations, the matrix may be reduced to a 
triangular form, while its determinant may then be easily obtained using Property 7. 


Example 2.7 Calculate the determinant 


2 3°=3: 4 
21-1 2 
62 1 0 
23 0 —-5 


(2.15) 


using the elementary transformations of the matrix corresponding to this determi- 
nant. 


Solution 


(1) We subtract from the second and fourth rows the first row, and from the third 
row—the first row multiplied by 3. 
As a result we obtain 
23-3 4 23 -3 4 
2412) Or 22 2 


= : (2.16) 
62 1 O 0-7 10 —12 
23 0 —-5 00 3 -9 
: sah 7 
(2) add to the third row the second row multiplied by — 5 
2.3) =3: 4 
0-2 2 -—2 
: (2.17) 
00 3 -5 
00 3 -9 
(3) subtract from the fourth row the third row: 
2:3) 3/4 
0-2 2 -2 
: (2.18) 
00 3 -5 
00 0 -4 
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As a result we obtain the determinant of the upper triangular matrix, which is 
equal to the product of the elements positioned on the main diagonal: 


2+ (—2)-3-(—4) = 48. (2.19) 


Note Note that linear algebra knows alternative methods to find the variable det A, 
in particular, the axiomatic definition [63]. 


2.4 Inverse Matrix 


Let A be a square matrix of size n x n, and J is an identity matrix of the same size. 
The matrix B is called the inverse of the matrix A, if the following equalities are 
fulfilled 


A-B=B-A=l. 
The matrix inverse of A is denoted as A~!. 


Note that the inverse matrix exists not for every matrix. 


Theorem 2.2 If the determinant of the matrix A is equal to zero, i.e. det A = 0, 
then the inverse matrix of A~! does not exist. 


A square matrix is referred to as nonsingular (or nondegenerate), if an inverse 
matrix is determined for it. Otherwise, A is a singular (degenerate) matrix. It is 
known that the matrix inverse of the nonsingular matrix is the only one. 


Theorem 2.3 /f the determinant of the matrix A = (a;j) is other than zero, i.e. 


A = det A 4 0, then the inverse matrix exists: 


1 
Atlas (Aij)’, (2.20) 


where (Aj;;) is a matrix formed by cofactors of the elements aj; of the matrix A 
(cofactor matrix). 


Note A matrix transposed to (A;;) is called adjugate, or classical adjoint, relative 
to the original one [15, 35, 64, 65]. 


The following statements are true: 


1. If the matrix A is invertible, then A? is also invertible, and (A’)—! = (A7!)!. 
2. If the matrices A and B are invertible, then (A B)~! = B7! A7!, 
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3. The matrix inverse of the upper (lower) triangular matrix is also the upper (lower) 
triangular one. 


4. If Aq! exists, then det(A~!) = 


det A’ 


Example 2.8 Find the inverse matrix of the matrix: 


O-5° 7 
A= ess at | (2.21) 
Se 


Solution The determinant of the matrix is equal to 


3 6 
det A = 2 —5 +7 
—2-3 aa3 5 —2 


= 0(=0 9) S(A1s 0) a= 15) SK (2.22) 


Calculate the cofactors: 


3 4 6 4 
Ay =(-)!t! =-1, Ap =(-1)'” = 38, (2.23) 
ee | 5.3 
6 3 5 7 
Ay3 = (-1)'? = —27, Ay = (-1)**! =1, (2.24) 
ie? Be on 
2-7. 2 5 
Ax = (-1)?*? =—41, A3 = (-1)**3 = 29, (2.25) 
5 -3 ies 
57 2]. 
A3) = (-1)*"! =-1, Ay =(-1)* = 34, (2.26) 
34 64 
25 
A33 = (-1)°*3 = —24, (2.27) 
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Write the matrix formed by the cofactors: 


=] 38 =27 
1 —41 29 |}. (2.28) 
=] 34,94 


As aresult, the desired inverse matrix will have the form: 


gab le et Pb Sir a 
A ay B84 34: |S. | a8: 542 (2.29) 
=F 99: = 54 7-99 DA 


2.5 Integer Powers of a Matrix 


The notion of raising a number to an integer power in matrix algebra is easily 
generalized. By definition we have 


A°=1, A'=A, A*=AA, A?=AAA, ..., (2.30) 
and if A is a nondegenerate matrix, then 
AP = (A7!)? = (AP)T!, (2.31) 


For diagonal matrices, their p-th power preserves the property of diagonality: 


P 
eae 0 d?0...0 


= ce (2.32) 


for all integer p. 
In order to solve the following problems, we will need the mathematical 
induction method. 
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2.5.1 Mathematical Induction Method 


For proving the statements depending on a natural parameter, mathematics widely 
uses the mathematical induction method [1, 17, 66] (from Latin inductio— 
derivation). 


Mathematical Induction Principle Let P(n) be a statement defined for all natural 
numbers n, and let the following conditions be fulfilled: 


(1) PQ) is true; 
(2) Vk > 1 the logical implication is true P(k) => P(k +1). 
Then P(n) is true for any natural n. 


The proposition / is usually referred to as the basis step, and the proposition 
2—inductive step. 

In order to prove identities by the mathematical induction method, the following 
is done. Let the statement P(x) take the true value when the considered identity is 
true for some natural number k. Then, two statements are proved: 


(1) basis step, i.e. P(1); 
(2) inductive step, i.e. P(k) = P(k + 1) for an arbitrary k > 1. 


According to the mathematical induction method, a conclusion is made about 
trueness of the considered identity for all natural values of n. 
Note that in mathematical logic a statement of the form P(n) is called predicate. 


Example 2.9 Using the mathematical induction method, we will prove the state- 
ment: 
1434+5+---+(2n—1) =n’ for all natural numbers n. 


Proof Let P(n) be the predicate “1+3+5+---+ Qn—-l= ne, 

Basis step 

For n = 1 we obtain 1 = 17, i.e. P(1) is true. 

Inductive step 

Assume that for n = k the statement 1 + 3+5+---+ (2k —1) =k’ is true. 
Prove the trueness of P(k + 1): 


14+3454+---4+@Qk-1)+Q@k+1)-1) 
= 4+ Qk+)-N)=kh4+2kK4+1=(k4+1)’. 
Thus, for any natural k the implication P(k) = P(k + 1) is true. Then, by 


the principle of mathematical induction the predicate P(n) has a true value for all 
natural n. 


Example 2.10 Relying on the mathematical induction method, prove that n? + n is 
even for all natural n. 
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Proof Denote f(n) = n> +nand P (n)—the predicate “f (7) is divisible by 2”. 
Basis step 
For n = 1 we obtain f(1) = 17 +1 = 2—even number, this is why P(1) is true. 
Inductive step 
Assume that f(k) is even for natural k > 1. Let us prove that it implies evenness 
of f(kK+ 1): 


FREAD HEK+IY + K+ YD HRM +2R+14k4+1 
=(P +k 4+2k4+2= fk) +2k4+ 1. 
Since in the right side of the obtained relation stands the sum of two even numbers, 
then f(k + 1) is divisible by 2. 


Note The statement of the example becomes apparent if we represent the expression 
k? +k in the form k? + k = k(k + 1). Out of two consecutive natural numbers, one 
is necessarily even, and their product is divisible by two. 


Example 2.11 Calculate the hundredth power of the matrix A = ae 
Oa 
Let us find several lower powers of this matrix: 
sn ee Wes ac\}jac}|_ a* 2ac ee a> 3a*c 
Oa Oa}|0a 0 a? 0 a 


An assumption arises that for all natural values of n the following equality is true: 
n 
ac a" na"—'e 


Oa 0 a® 
In order to verify this assumption, let us use the mathematical induction method. 
Prove that for all natural n the equality is fulfilled: 


i, a" na"—\¢ 
A= ‘ , : (2.34) 
a 
n n—-1 
Proof Denote the predicate “A” = SE through P(n). 
0 a™ 


Basis step 

Consider the case n = 1. The equality takes the form A! = A, which is the true 
statement. 

Inductive step 
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Assume that P(k) for some k = 1,2,... takes the true value, i.e. Ak = 
ak kak . 
. Prove that P(k + 1) is true. 
0 ak 


Write the predicate P(k + 1) in the form: 


k+1 k 
atti |@” &+ Date | (2.35) 
0 qkt! 


Let us use the inductive supposition and rewrite the sum A**! as A‘ A: 


ARH ak kak—!c ac| _ ak. aakc+kak—'c-a 


0 ak Oa 0 a’ -a 
After algebraic transformations we obtain 


a) k4 Da"e 


Akt = 
0 qkt! 

This expression coincides with (2.35). Then, for any k = 1,2,... the implication 
P(k) => P(k + 1) is true, and the mathematical induction method has proved that 

q” na"—!¢ 
Al = for all natural n. 

0 a” 

Substituting in (2.34) the value n = 100, we finally obtain A! 


a!© 100a%c 


0 q 00 


The operation of raising the matrix to power can be executed by a relatively small 
number of calculations, if the original matrix is presentable in the form 


B=U7~'DU, (2.36) 


where D is a diagonal matrix, U is a nonsingular matrix: UU~! = J. We will carry 
of the calculation of B? = (U~!DU)? relying on the associativity property of the 
multiplication operation (see page 8): 


BP = (U~'DU)? = (U~!DU)(U~! DU)(U~! DU) ...(U~! DU) (U7! DU) 
ES 
p times 
= U~'p(iuu!)pb(uu) DUU....U~!)D(UU!) DU 
= U~!DIDID...DIDU =U~!(DD...D)U =U7~'!DPU. (2.37) 
—, 


p times 
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The rule of raising the matrix B to power is generalized in case of integer negative 
p = —abs(p): 


Bo abst) — (Ry = (ot Buy ye) 
= (ole oy ye = (ae tye 


=U pry = RP, (2.38) 


Therefore, the above reasoning proves the theorem of raising to an integer power 
of matrices B of a special form (2.36). 


Theorem 2.4 (Matrix Power Theorem) For an arbitrary matrix B, presentable 
in the form B = U~'DU, and the integer number p the following equality is true: 


BP =U~'DPU. (2.39) 
Example 2.12 Prove that various integer powers of the matrix commute 
API AP2 = APM A?! (2.40) 


Proof Indeed, based on the properties of the operation of raising to power, we have 
API AP2 = APIt+P2 = AP2TP1 = AP2AP1. 


2.6 Functions of Matrices 


A matrix may act as an argument of some function. Let us begin with consideration 
of a polynomial of a matrix. As is known, a polynomial of degree p in variable x is 
the sum f(x) of the form: 


f(@) = co tei + cnx? +++ + epx?, (2.41) 
where c; (i = 0,1,..., p) are arbitrary numerical coefficients. 
By the polynomial f(A) of degree p of the matrix A we will understand the 
following expression: 


F(A) = col +e1A 4+ 2A? 4+---+,A?. (2.42) 


As is easy to see, the value of the function f(A) is, in its turn, some matrix as 
well. Its elements are expressed by the formulas: 


(f(A))ij = codiz + e1(A)ij + €2(A7)ij +++ + ep(A”)iz- (2.43) 
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Using the theorem about the power of a matrix of the special form (2.39), we will 
write the relation 


f(U~'AU) = U7! f(A)U, (2.44) 


which is true for an arbitrary orthogonal matrix U. 
Fractionally rational function of a matrix is the value 


fi (A) _ = -] 
= fi(A)(f2(A)) = (f(A) f(A) (2.45) 
f2(A) 


if det fo(A) 4 0. 

Let us demonstrate the correctness of the introduced definition, i.e. that two 
products in (2.45) are always equal to each other. Indeed, having multiplied both 
parts of the equality fj (A)(f2(A))7! — (f2(A)) 7! fi (A) by f2(A) on the right and 
then by f2(A) on the left, we obtain 


S2(A) fi(A) = fi(A) f(A). (2.46) 


The polynomials f;(A) and f(A) depend only on the matrix A, and, therefore, 


commute. Due to this, the fraction 


(A) . 
is defined correctly [53]. 
2(A) 


2.6.1 Exponent and Logarithm 


The infinite sum of the matrices of size m x n of the form 


A+B+C+---+Z+--: (2.47) 

is called a series. It is said that the series converges, if for alli = 1,2,...,m 

and j = 1,2,...,m converge the sequences of the respective components of these 
matrices [53]: 

Qij + bij ++ + hig He (2.48) 


By definition, the exponent e“ of the square matrix A is set equal to the sum 


A 1 2 1 3 1 
eo = I+ A+ —A°+ —APH ++: +—APH--: (2.49) 
2! 3! p! 


It is known that this series converges for any real square matrix [37]. The matrix [ 
in the formula (2.49) is taken of the same order as A. 
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In a compact form, this sum can be presented as 
ga Se, (2.50) 


Sometimes, especially when writing formulas with fractions, another notation for 
the matrix exponent is used, namely exp A. 


12 
Example 2.13 Calculate exp , where A € R. 
01 


Solution As follows from the formula (2.34) in Example 2.11, for all natural p the 


P 
rae 1A 1 pr 1x 
equality is fulfilled — . Denote A = 
01 0 1 01 


Using the definition (2.49), we obtain 


[ee 


AP 10 1h 1} 12a 1} 13a 
eA =i — = + aa a 

mie 01 01 21/0 1 3110 1 
1 a) “8 
=e ge ae 1 

p=o P! aa _ ray) = e re a3 1d 
0 ae, 0 e Oe 01 

p=0 P: 


(2.51) 
During calculation of the elements of the matrix e4 we used the definition of the 
number e = 2.71818... (also known as, bases of the natural logarithms), known 
from the course the mathematical analysis [76]: 


er) 

=> Es ! u : 2.52 

e= a at Tie ca ale (2.52) 
p=0 


Note The notion of the matrix exponent is widely used in the theory of systems of 
differential equations [3]. 


Logarithm of the square matrix A is the sum 


InA= (A-D=F(A=D245 (AD (PLA oi, 4255) 
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if this series converges. As in the case with the matrix exponent, the identity matrix 
T in the formula (2.53) must have the same order as A. In a compact form, the 
sum (2.53) can be presented as follows: 


InA= > Dt i aT, 
P 


p=1 


12 
Example 2.14 Calculate In , where A € R. 
01 


Solution We will need the natural powers of the difference of the matrices (A — 


OA 
IHN= . 
00 
; > OA] )OA 00 , 
Since (A — 1) = = , then all the summands of the series 
00))00 0 
in (2.53), except the first one, are in this case equal to the zero matrix. Therefore, 
OA 
the value In A is fully defined by the first summand of the series In A = ; 
00 
Theorem 2.5 For the determinant of an arbitrary matrix A, the formula 
det A = exp(tr In A) (2.54) 


is valid. 


Note Many examples of using various functions of matrices for solving practical 
problems are considered in [34]. 


2.7 Matrix Rank 


Minor of the k-th order M'"2"""« of the matrix A of size m x n is the determinant 


aehegy 


of the matrix obtained from the elements, standing at the intersection of the 
selected rows with numbers (1, i2,..., ix and columns with numbers j1, j2,..., jk 
on condition that | < i) <i2 <---<ipt <mand1 < jl < joa <--+ < je <m. 


Example 2.15. Given the square matrix 


Q\1 G12 a13 
A= a2| aA22 a23 |> (2.55) 


431 432 433 
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for which the number of rows m and the number of columns n are equal tom = 
n=3. 
Then the second order minors of the matrix A have the form 


1,2 _ |@11 412 1,2 _ |@11 413 
My) = , Mi 3= (2.56) 
a2) a22 a21 423 
1,3 J@411 412 1,3 |@11 413 
MM) = M,3= (2.57) 
a31 a32 431 433 
and so on until the minor 
a22 a23 
M3 = (2.58) 


432 433 


All in all, for the matrix of size 3 x 3 nine second order minors can be formed 
(the number of arbitrary order matrix minors see in solution of Problem 2.18). 


The number r is called the rank of the matrix A, if there exists a minor of order 
r, other than zero, and all minors of greater order are equal to zero. 

Any minor of maximal order r, other than zero, is referred to as basic minor. 

For the rank of the matrix A the designation rk A is used. 

In the following presentation, we need the concept of linear dependence of the 
matrix rows. Let us give k rows of the form 


U; = [uy 412 ..- Un, 
U2 = [u21 u22 ... Urn, (2.59) 


Ux = [Ug ux2 «.- Urn, 


each of which contains n real numbers. 

We multiply every element of the first row by real number a1, every element of 
the second row multiply by a1, etc. Then we add corresponding elements of rows. 
As aresult, a new row W forms 

W = U, + 02U2 + ++ + ou 
=ay[uj1 U2... Un] 


+ o2[U21 U22 ... U2n] (2.60) 


+ ag lug, U2 ... Un]. 
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The row W is called a linear combination of rows U (i = 1,2,...,k). The 
numbers a;, where i = 1,2,...,k, are said to be the coefficients of a linear 
combination of rows. 

If there is a collection of real numbers a1, a2, ..., @%, among which at least one 
is not equal to zero, such that W is the zero row, then Uj), U2,..., Ux are called 
linearly dependent. Otherwise, these rows are said to be linearly independent. 

Similarly, the definition of linear dependence/independence of matrix columns is 
introduced. 


Example 2.16 The rows U; = [1 0 —3], U2 = [3 —2 1], U3 = [5 —2 —] are 
linearly dependent, since there is a linear combination of them equal to the zero 
row: 


2U; + U2 — U3 = [000]. (2.61) 


Theorem 2.6 (Basic Minor Theorem) The number of linearly independent rows 
and columns of a matrix is the same and equals to the order of the basic minor. And 
the rows (columns), included into the basic minor, are linearly independent, and the 
rest are linearly expressed through them. 


Note A zero matrix has no linearly independent rows. This is why the rank of the 
matrix formed by zero elements will be considered as equal to zero by definition 
[36, 47]. 


The elementary transformations do not change the rank of a matrix. 
Recall that the elementary transformations of a matrix are (see Sect. 2.3): 


(a) exchange of two rows (columns); 

(b) multiplication of a row (column) by a non-zero number; 

(c) addition to one of the matrix row of another one, multiplied by any non-zero 
number (the same for the columns). 


When calculating the rank of the matrix, deletion of a zero row (column) or one 
of the two proportional rows (columns) is used, it does not change the rank of the 
matrix. 

The matrix is referred to as echelon matrix, if each its row begins with a strictly 
greater number of zeroes, than the previous row. 

One of the basic methods of finding the rank of the matrix is the method of 
elementary transformations. 

Elementary transformations method allows bringing the matrices to the 
echelon form with the help of the following algorithm: 


(1) Select the row at the beginning of which stands a non-zero element. This row is 
written first and is called the pivot row. 
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(2) To all the remaining rows the pivot row is added, multiplied by 


Gil 
oe (2.62) 


a)1 


(here i is the number of the row to which the pivot row is added). As a result, in 
all rows except the pivot one the first elements will be equal to zero. 

(3) Out of the remaining rows the one is selected whose second element is not equal 
to zero. It is written second and is considered to be the pivot one. 

(4) To the remaining rows the pivot row is added, multiplied by 


sees (2.63) 
a22 


As a result in the second column the zero elements have formed, except the first 
and the second rows. 


This process continues until obtaining the echelon matrix. The number of non- 
zero rows in this matrix will be its rank. 


Example 2.17 Find the rank of the matrix using the method of elementary transfor- 
mations: 


357 
A=1123]. (2.64) 
135 


The third order minor is the determinant of the matrix: M A Ps . = det A. 
Calculate the determinant of the matrix A by the first column expansion method: 


3+1 57 


23 57 
det A =3-(—1)!*! 41-(-1)2+! S(T) 
35 35 23 


=3.(0=9) = 125 2) 4 1s05 1) SH 3441S 0. 


Consider the second order minor 


My = =140. (2.65) 


Since it is not equal to zero, then rk A = 2. 


It is often difficult to calculate the matrix rank based on its definition because one 
has to search through a great number of minors. As a rule, in order to simplify the 
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calculation of the matrix rank, it is reduced to a simpler form using the elementary 
transformations (see Sect. 2.3). 

So, the rank of the matrix presented in the echelon form is equal to the number 
of the non-zero rows. 

In what follows, we will use the following designations for the equivalent 
transformations: 


e A — B—the matrix B is obtained as a result of the elementary transformation 
of the matrix A; 

¢« (i) + a(j)—addition to the i-th row of the matrix of the row with number /, 
multiplied by the constant a; 

¢ (i) — a(j)—subtraction from the i-th row of the matrix of the row with number 
j, multiplied by the constant a; 

* (i) <= (j)—exchange of two rows. 


Similar designations will also be used for operations with columns, and the 
column with number j will be denoted by [/]. 


Example 2.18 Find the rank of the matrix 


p38 
53 234 


(2.66) 
1-3 —5 0-7 
7-5 141 
Solution Swap the first and the third rows: 
1-3 —5 0-7 
5-3 2 3 4 
(2.67) 
3=1 3 25 
7-5 14 1 


To the second row, add the elements of the first row, multiplied by (—5), to the 
third—by (—3) and to the fourth—by (—7). As a result we obtain 


1 —3 —5 0 -7 


0 12 27 3 39 
‘ (2.68) 


0 8 18 2 26 
te see 
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Divide the second row by 3, and the third and fourth rows by 2: 


1 —3 —5 0 -7 
04 9 1 13 
(2.69) 
04 9 1 13 
0 8 18 2 25 


Then subtract from the third row the second one, and from the fourth row the 
doubled second one: 


1-3 -50-7 1-3-50-7 
04 9113} wou fO4 9 113] ei0) 
00000 00 00-1 
00 00-1 00000 


We have obtained the echelon form of the matrix. The number of non-zero rows 
is three and therefore the rank of this matrix is three. 


There is one more method to calculate the matrix rank. It is referred to as the 
bordering minor method and consists in the following. For computing the variable 
rk A, consecutively compute the minors, passing from the lower order minors to the 
higher order minors. If the r-th order minor is already found 


M= (2.71) 


that is other than zero, then it is enough just to compute the minors of the (r + 1)-th 
order, bordering the minor M: 


a1 Gir Ait 
Ldteew ode gad fopall tied (2.72) 
ayp| .-- Arr Arty 
atl +++ Ar Att 


If they all appeared to be equal to zero, we obtain rk A = r. 

The bordering minor method is especially convenient for the problems where 
functional matrices are present, or the matrices whose elements depend on the 
parameters (see, for example, Problem 2.60). 
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Review Questions 


nABWN Fe 


. What is a determinant of the second order? of the third order? 

. Formulate the rule of triangle for computing a determinant of the third order. 

. Define the concept of inversion. 

. Write a formula for the determinant of the n-th order. 

. How are the additional minor Mj; of the element a;; and its cofactor Aj; 
interconnected? 

. Formulate Laplace theorem. 

. Enumerate the general properties of determinants. 

. What matrix transformations are elementary? 

. What is an inverse matrix? 

. What matrix is called degenerate? 

. What is the method of mathematical induction used for? 

. How are the functions of matrices computed? 

. Define exponential and logarithm of a matrix. 

. Which rows are linear dependent? 

. Formulate the theorem of a basic minor. 

. What is the elementary transformations method based on? 

. Enumerate the methods of finding the matrix rank. 


Problems 


2.1. Expand the definition (2.5) for the case n = 3. 
2.2. Find the number of inversions in the permutation (5, 1, 4, 3, 6, 8, 7, 2). 


2.3. How many inversions are there in the permutation (n,n — 1,...,2, 1)? 
2.4. Find the number of inversions in each permutation of a collection of 2n 
numbers: 


(1) (1,3,5,7,..-,2n —1,2,4,6,...,2n); 
() C.46. 0.0520 V3: $27, R= 1), 


2.5. With what sign does the summand ayjdn—1,2...42,n—141n appear in the 
expression for the determinant (2.5)? 
2.6. Compute the third order determinants: 


3-2 1 12 0 20 5 
()|-2 1 3]; @ 01 3}; @ 1 3 16); 
2 0 -2 50-1 0 —1 10 
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Ce 210 200 
(4)|-2 3 2/; )]10 3]; (©)|330]. 
025 05-1 444 


2.7. Compute the determinants of the fourth order: 


-30 00 2-13 4 2-110 
2200 0-15 -3 012-1 
(a) ; (b ; (c) : 
1 3-10 00 5-3 3-12 3 
-15 35 0002 3161 


2.8. Solve the following equation relative to the variable x: 


x11l 
lx11 

= 0. 
lixl 
llix 

2.9. Compute the determinant 

111...1 1 
121..,1 1 
113...,1 1 
l11...p 1 
111 lp+l 


where p is a natural number. 
*2.10. Compute the determinant of the matrix Q of size n x n, whose elements are 
equal to 


(a) qij = 6i,j41 + Si41,j, 
(b) gij = 9i,j+1 — Si41,) 


for all 1 <i, j <n. Here dx,,%, is the Kronecker symbol (see page 4). 
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2.11. Fibonacci” sequence 
F, = 1,1, 2,3,5, 8, 13, 21,34,55,... forn=1,2,... 
is determined by the recurrence relation: Fy42 = Fy + Fy+1 with the initial 
conditions Fy = Fy = 1 [28, 70]. 


Prove that the (n + 1)-the term of the Fibonacci sequence is equal to the 
determinant of the n-th order 


(2.73) 


«2.12. The Vandermonde? determinant is the determinant of size n xn, composed 


of the real numbers aj, 42,..., dn: 
1 1 1 1 
a a2 a an 
Vn=|a? af a2... a2 |. (2.74) 
n—1 -n—-1 -n-1 n—-1 
a ay 3 ay 


(1) Verify that for n = 2 this determinant is equal to V2 = a2 — a). 

(2) By the mathematical induction method, prove that the Vandermonde 
determinant V,, is equal to the product of all the possible differences 
aj —ajforl<i<j<nm: 


Vn = | [(@ - ai). (2.75) 


i<j 


Under the name Fibonacci is known Middle Age mathematician Leonardo Pisano (about 1170— 
about 1250). 


3 Alexandre-Théophile Vandermonde (1735-1796), French mathematician and musician. 
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2.13. 


2.14. 


2.15. 


2.16. 


2.17. 


2.18. 


2.19. 


*2.20. 


*2.21. 


2.22. 


2.23. 


2 Matrix Algebra 


Let A be a matrix of size n x n, m is an arbitrary natural number. Is it true 
that the equality det(mA) = m” det A is fulfilled? 

At the examination in linear algebra, the student says that the determinant 
of the sum of two matrices is always equal to the sum of the determinants of 
these matrices: det(A + B) = det A + det B. Is the student right? 

Is the determinant 


1 1 1 1 10000 
10000 1 1 1 1 
1 1 1 10000 1 (2.76) 
1 10000 1 1 1 
1 1 10000 1 1 
a positive number, a negative number or zero? 
Is the determinant 
1 10000 3 4 5 
5 1 2 3 10000 
4 5 1 10000 3 (2.77) 
3 4 10000 1 2 
10000 3 4 3 1 


a positive number, a negative number or zero? 
Using Python, compute the determinants in Problems 2.15 and 2.16 and 
verify the correctness of the solutions of these problems. 
Find the number of minors of the k-th order in a matrix consisting of m rows 
and n columns. 
How many minors of the k-th order in a matrix of size n x n do not contain 
diagonal elements of the original matrix? 
What greatest value can take the determinant of a matrix of size 3 x 3, 
consisting of the elements +1 and —1? 
What greatest value can take the determinant of a matrix of size 3 x 3, 
consisting of the elements 0 and 1? 
Will the determinant of the matrix change if its columns are permuted in the 
reverse order? 
Computing the determinant of the matrix using Python 

In the text file input . txt are successively written in rows the elements 
of the integer square matrix A. Using the recursion, compute the determinant 
det A. Enter the result into the text file output.txt. 
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2.24. 


*2.25. 


*2.26. 


*2.27. 


*2.28. 


*2.29, 


2.30. 


Evaluate the number of multiplications executed by the recursive algorithm 
for computing the determinant of the matrix of size n x n (see previous 
Problem). 

Prove that the exact number of multiplications executed by the recursive 
algorithm for computing the determinant of the matrix of size n x n (see 
Problems 2.23 and 2.24) is equal to 


T(n) = enT(n, 1) —n!, 


foe) 
where e is the base of natural logarithms, [(n,x) = fete dt— 
xX 


incomplete gamma function (for the natural n € N the following equality 
n—1 yk 

ictus TeX) = G— The = —). 

Obtain the asymptotic estimate of the mean value of the number of 

inversions A(JV) in the array consisting of N elements, for N — oo. 

Prove the validity of the identity 


det(/ + eB) = 1+ etrB + O(e”) fore > 0, (2.78) 


where B is some square matrix, J is an identity matrix of the same size. 
Obtain the asymptotic estimate of the variable 


I 
det | ————— } fore — 0, (2.79) 
(I+ eB)P 


if B is an arbitrary square matrix, J is the identity matrix of the same size, 
peN. 

Assume that M = G+ €H, where ¢€ is areal number, G is an invertible ma- 
trix. Relying upon the equality det A = exp(tr In A) (see the Theorem 2.5 
on page 59), prove that 


det M 


= det G(1 + etr(G~'H) + set (G-'H) — tr(G~'H)*) + Oe*)) 


(2.80) 
fore > 0. 
Assume that the real parameters a, b, c and d are selected, such that the 
ab 


inequality ad — bc # 0 is valid. Compute the matrix inverse of A = 
cd 
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2.31. Find the inverse matrices of the following matrices: 


12 34 

() ; @) ; 

34 57 
3-4 5 273 |r <2: 2 
(3) {2-3 1 |; @) |394]; G) [21 -2 
3-5-1 153 2-2 1 


2.32. Find the inverse matrices of the following matrices: 


1111 
111 
11 0222 
(1) pr 2) O22) 
02 0033 
003 
0004 


2.33. Calculate the commutators [A, A~'] and [A, A7~!], if A is an arbitrary 
nonsingular matrix. 
2.34. For what values of the real parameter 4 the matrix does not have the inverse 


one? 
—-1liAA 
60 a ie cae te 
6 42 
3 rA 2X 
(2); A A -1 
-1-1A, 


2.35. Compute the matrix inverse of A: 


l1a00 
01,0 
A= p ; 
O0ly 


0001 


where a, 8, y € R. Does the matrix A~! exist for all possible values of the 
parameters? 
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«2.36. Compute the matrix inverse of G, of sizen x n, where n > 2: 


000...0 
000. 0 
Gi) etetet dees . (2.81) 
O10.. 0 
100...0 0 


2.37. The elements of a Hilbert* matrix H = (hij) are set by the rule hj; 


1 
——., where i, j = 1,2,...,n. 
i+j-1 
With the help of Python, compute H~!, H- H~! and H~!- H forn = 
6,7, 8. 


*2.38. Find the inverse of the following matrix: 


where A is some real number. 
2.39. With the help of the mathematical induction method, prove the matrix 
product inversion formula 


(A1A2...An—1An)| = A,'AT!,...Az'A7’- (2.82) 


2.40. Solve the matrix equations: 


yal 64 
(1) ‘X= ; 

02 21 

= a 21. i 
2) | 0 22|-x=|-1 0 2]; 

0 23 it 


4David Hilbert (1862-1943), German mathematician. 
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2.41. 


*2.42. 


2.43. 


2.44. 


2 Matrix Algebra 


f 163 Of = 2 
(3) X-|}-3 6 2 | =|-1-2-2]; 
2-6-3 0 1 10 
23 ees ini 
(4) a = 
=82 54 01 


Assuming that a is an arbitrary real number, raise the matrix of power: 


256 
100 


010 : (2.83) 
Oal 


Raise the matrix of power: 


et0|- 5 (2.84) 


where constants g,h € R. 
Using the mathematical induction method, prove that the n-th power of the 


01 
matrix F = has the form 


Ph ee (2.85) 
Fy Fa4i 


for all natural values n > 1, where F;, are the Fibonacci numbers (see 
definition in Problem 2.11 on page 67). 


lay 
Compute the n-th power of the upper triangular matrix A = |01 6]. 


001 
where a, B,y ER. 
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2.45. Compute the g-th power of the functional matrix 


cos@g sing 
U(g) = 
— sing cos 


for all integer values of qg € Z. 

2.46. It is known about the matrices A and B that their commutator is the identity 
matrix: [A, B] = J. Compute [A, B%] for all integer values of the parameter 
qgéZ. 

2.47. Compute the value of the function f(x) = x* — 3x + 2, if as the argument 
is taken the matrix A, where 


10 
(1) A= ; 
01 

=) 

(2) A= 
23 


2.48. Compute the value of the function g(x) = x* + x — 3, if as the argument is 
taken the matrix A, where 


100 
(1) A=] 1104); 
111 
—2 10 
(2) A=| 3 -11 
2 10 


2.49. Compute the value of the fractionally rational function g(A), if gv) = 
x3+x?—3x—5 


x3—5x—2 ae 
20 0 
(1) A= ]0-40]; 
00 1 
110 
(2) A=|/010}. 


O11 
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2.50. Show that the equality 


0-1 cos 1 —sin1l 


10 sinl cosl 


is valid. 


2.51. Find e4 for 


(1) A=]010 


100 


000 


«2.52. Find e4 for 


2.53. 
+254. 


2.55. 
2.56. 


2.57. 


2.58. 


(1) A=]010 


010 
(2) A=|/001}. 


101 


(2) A=]111 


001 


For the matrices specified in the previous problem, compute In A. 
Prove that for the commuting matrices the exponential of the sum is equal 
to the product of the exponentials of each of the summands: 


exp(A + B) = exp(A) exp(B). 


Prove that exp(tA) = 1 +tA+ O(t?) fort > 0. 

Consider all possible matrices of size 3 x 3 that contain no zero column. 
Enumerate pairwise various echelon forms of such matrices. 

Which elementary operations with the elements of the echelon matrix 
preserve its echelon form? 

Find the ranks of the matrices: 
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2-43 10 
ie See: ae ee) 
ey ma Ne Sy 
() A=]-1 3 -6-2 4 |; Q) A= 
01-41-34 
A Bw 
4-7 4 -45 
Oo aa 
Lo 1,3 
af Sd 5 
421256 
(3) A= >; 4 A=] 3 1 7 4; 
ae ee te) 
0-510 
21-10 
23 0 
357 4322 
(5) A=]123}]; ©) A=]0211)]; 
135 0033 
i 3-331 
i- sivas 
FSLe3A 
(7) A=] —2 1 526]; (8) A= 
ci ees Oe 
21472 
TT OF 


2.59. With the help of the bordering minor method, find the maximum value that 
the rank of the matrix A can take 


Bay Do ye Se 
A=|]y-a-y 0 @ |}, 
a—-p B -a 0 


if w, 6 and y are real constants. 


*2.60. Find the ranks of the following matrices for all possible values of the real 
parameter A: 


120 


(a) }110]; 
001 


76 


(b) 


(c) 


(d) 


(e) 


(f) 


(g) 


(h) 


(i) 


1 


1 


0 O 2+A 
1+A2 0 4 


1 0 O 


0 2 1 
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Answers and Solutions 


2.1 Solution. 
For n = 3 we have 3! = 1-2-3 = 6 permutations: 
ky =1,k2 =2,k3 =3; ky = 1,k2=3,k3 =2; ky = 2,kp = 1,k3 = 3; 
ky =2,ko =3,k3 =1; ky =3,ko. = 1,k3 = 2; kj =3,k0.=2,k3 = 1. 


Therefore, the sum in det A will consist of 6 terms, half of which is taken with a 
positive sign, and another half—with a negative sign: 


det A = > (1)? atk; 42k, 43k; 
perm 
= (-1)°ai1a22433 + (—1)'a11a23a32 + (—1)'a12a21433 
+ (=1)?a12a23a31 + (—1)?a13a21432 + (—1)3a13a22431 
= 411422433 + 412423431 + 413421432 
— 413422431 — 412421433 — 41123432. 
The obtained formula, of course, conforms with the definition (2.3). 


2.2 Answer: the number of inversions is equal to 10. 


2.3 Solution. 
The first element of the permutation, equal to n, forms n — | inversions paired 
with each of the elements n—1,n—2,..., 1. The second element of the permutation 


n — 1 forms with the remaining elements n — 2 inversions. As is easy to see, 
the permutation element, equal to k, where 1 < k < n, forms k — | inversions. 


Therefore, the total number of inversions in the permutation (n,n — 1,..., 2, 1) is 
n 
equal to the sum )° k = n(n — 1)/2. 
k=1 
2.4 Answer: 
n(n — 1) 
1) ———; 
cer 
n(n 
2) ———.. 
(2) 5 
2.5 Solution. 


The summand dy1dy—1,2 ...d2,n—1@1n in the formula (2.5) is assigned the sign 
(-1)? = (—1)""—)’, since the number of inversions in the permutation (n,n — 
1,...,2, 1) is equal too = n(n — 1)/2 (see Problem 2.3). 
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Note. With the help of the function “floor” |x ]—the greatest integer, which is 
less than or equal to the argument x, i.e. |x| = max(n € Z, n < x), the answer 
can be written in a more compact form: (—1)? = (—1)'"/7I, 


2.6 Solution. 
(1) Calculate the determinant, using the rule of triangle for calculation by the 
formula (2.3). 
3 -2 1 
—2 1 3)=3-1-(-2)+(-2)-3-2+1-(-2)-0 
2 0 -2 
—1-1-2-—(-2)- (-2)-(-2) -—3-3-0=-—-12. 


(2) Expand the determinant in the first row: 


i i 0 3 01 
01 3)=1 = 0 

0-1 5-1 50 
50-1 


=1-(1-(-1)-0-3)-2-(0-(-1) —3-5)+0-@-0-5-1) 
= -1+30=29. 


(3) Expand the determinant in the first row: 


Zi. 
1 3 16|=2-(3-10—(-1)- 16) —0-(1- 10—0- 16) 
0-110 

S520. 3) 

= 92—5=87; 


(4) Expand the determinant, for example, in the third row: 


2 -13 
—2 3 2)=0-G-3-—(-1)-2)-—2-@-2—-(-2)-3) 
0 25 


+5-(2-3-—(-2)-(-1)) = —204 20=0. 
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(5) Expand the determinant in the first row: 


210 
(0 4 | 2290-553) -1- 0 CD03) 0205-0) 
05-1 


= —304+1=—-29. 


(6) The determinant of the lower triangular matrix is calculated as the product of 
its diagonal elements: 


200 
330) =2-3-4= 24. 
444 

2.7 Solution. 


(a) Expand the determinant in the first row: 


-30 0 0 
200 2 00 2 20 
2200 
= —3-|3-10)/—9-] 1 -10}/+0-]1 30 
1 3-10 
535 -13 5 -155 
-15 35 
22 0 
—0-/}1 3-1] =-—3-(-10+0+0-—0-0-0) 
-15 3 
—0+0-—0= 30. 


Note that the original matrix is the lower triangular one, hence its determi- 
nant can be calculated by a simpler method as the product of diagonal elements: 
A = (-3)-2-(—1)-5 = 30. 
(b) Expand the determinant in the first column: 


2-13 4 
-15-3 —-13 4 —-13 4 

0-15 -3 
=2-|0 5-—3)/—0-| 0 5-—3]/+0-|-15 -3 

005-3 
00 2 00 2 00 2 


0002 
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=13 a4 

— 00/4 $25) 2210404 0-0-6 —0)-0 
05-3 

= 0S 20. 


The result can be obtained faster if we note that the matrix is the upper 
triangular one. Then A = 2- (—1)-5-2 = —20. 
(c) Expand the determinant in the first row: 


2-11 0 
1 2-1 02-1 01-1 
012-1 
=2-|-12 3|]—(-1)-|32 3/+1-]3-1 3 
3-12 3 
161 36 1 3 1 1 
3161 
012 
—0-)3 -1 2) =2-(2+6+6+2- 184 2) 
3 16 
—(-1)-(0+ 18— 18+6—0-6) 
+1-(00+9-—3-—-3-0-3)-0=0. 
2.8 Solution. 


Denote the determinant by A and expand it in the first row. We will expand the 
obtained third order determinants in the first row or in the row consisting of ones, if 
any: 


x11 111 1x1 1x1 
A=xX]Ix1)/—Jlx1/+jJl11)-Jl1x 
11x 11x 11x 111 


= x(x? -)-@-1)+d0-x))-(@?-D-@-D+(0-x)) 
+(-(@?-D+@-1)-(U-x))-(@?-D-@-D+(0-x)) 
= x(x> — 3x +2) —3(x* —2x +1) = x(x — D(x? +. x — 2) —3(x — 1)” 


= (x — 1)(x3 4+ x? — 5x +3) = (x — 13° (« +3). 


As a result, the roots of the equation A = 0 are equal to x = 1 and x = —3. 
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2.9 Solution. 

Apply to the determinant the following equivalent transformations: from each 
row, starting with the second one, subtract the first one. As a result we obtain the 
determinant of the upper triangular matrix: 


Mt. t a 
O10... 0° 6 
002... 0 0 
000... 0 Of 
600.0 p= 70 
000... 0 p 


Such a determinant is equal to the product of the diagonal elements of the matrix: 
1-1-2...(p—-1)-p=pl. 
2.10 Solution. 


(a) Denote the determinant of the matrix Q by Q, and write it in an explicit form: 


0100...0 0 

1010...0 0 

0101...0 0 
Qn = 

0000 01 

0000 10 


Let us use the expansion in its first row, following which expand the obtained 
the determinant of order (n — 1) x (n — 1) in the first column: 


110...0 0 
Oat 01...0 
10...0 
Q, = (-1)'?. van =(-1))PF (9... 
eee esa 4 
000...0 1 be _ 
000...1 0 - 


82 2 Matrix Algebra 


Note that the problem reduced to computing the variable Q,—2. For the 
smallest possible values of the order of the matrix n = 1 and = 2 we have 


01 
Q; = det[0] =0, Q> = det =k 
10 


Thus, the recurrence relation is obtained: 


On = —On-2, 
QO; =0, Q=-1. 


By the mathematical induction method we can show that its solution Q, will 
have the form: 


eo = (-1)"/?, ifn is even, 
eg ti if n is odd. 


(b) Using Laplace’s method, similarly to item (a) we obtain the recurrence relation: 


oF = Qn-2, 
Q1=0, Q=1. 


Its solution, as is easy to show with the help of the mathematical induction 
method, has the form: 


1 n 


2.11 Solution. 

Denote by P(n) the predicate “F,,; = D(n)”’, where D(n) is the determi- 
nant (2.73). Let us use the mathematical induction method. 

Basis step 

For n = | andn = 2 we have 


, 1 10 
Fy = —true, F3=J/-] 1 1|—1Krue. 
-11 
0-11 


Inductive step 
Assume the trueness of the statements P(k) fork = 1,2,.... 
Prove that this entails the trueness of P(k + 1). 
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Indeed, expanding the determinant D(n + 1) in the first row, we obtain 
Dn+i = Dn + Dn-1- 


According to the inductive supposition D(n) = Fy41, Din — 1) = Fy. 

Therefore, we have obtained the true equality Fy42 = Fhai+ Fh. 

Thus, the mathematical induction method has proved that F,4; = D(n) for all 
natural n. 


2.12 Solution. 
(1) According to the definition of (2.74) for n = 2 we have 


1 1 
Vn = = a2 — 4a). 
a\ a2 
(2) Let us introduce for consideration the predicate “V, =  [] (aj; — a;)” and 


i,je[ln] 
i<j 


denote it by P(n). 


Basis step 
The case of the least n = 2 is proved in item (1) of this problem. 
Inductive step 


Let for some natural k > 2 the equality be fulfilled yi = [] (a j — ai). 
i, f€[1,k] 
rej 

Prove that Vi41 = I] G@-4a). 

i,jeLL e+] 
i<j 
Transform Vz; as follows: from the (k + 1)-th row subtract the k-th one, 
multiplied by a1, then from the k-th one subtract the (k — 1)-th one, also multiplied 


by a; and so until the second row inclusive: 


1 1 1 sat 1 
0 a2 —-a\ a3 — a\ pacar ak+1 — a] 
Vi+1 = 10 ax — ajar ay —aja3... iy — aydgs|- 
k k-1 _k k-1 k k-1 
Oa, —aja, a, —a\a3 ... Ge — AA 


The first column of the obtained determinant is formed by zeroes, except the 
element in the upper left corner, equal to one. Using this fact, it is easy to perform 
the expansion in the first column. After taking the common multipliers outside the 
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sign of determinant, we have 


1 1 1 
a, a3 Ak+1 
Vet1 = (a2 — a1)(a3 — a1)... (@e41—41)| a} af... a2, | |- 
k-1 k-1 k-1 
ay 43 a1 


As a result we have obtained the determinant V;, which, according to the 
inductive supposition, is equal to the product of all the possible differences a; — aj 
forl<i<j<k. 

Therefore, the mathematical induction method has proved the formula of (2.75). 

Note. The Vandermonde matrix, i.e. a matrix of the form 


1 1 1 
a\ a2 an 
2 2 2 
ay ay a; 
n—1 _n-1 n—-1 
ay ay an 


is widely met in the theory of approximation of functions by polynomials [11, 58]. 


2.13 Solution. 

In case of multiplication of any row by a real number, the determinant of this 
matrix is multiplied by this number. Therefore, as all the elements of the matrix A 
are multiplied by m, the determinant is multiplied by the value 


mXMmMX+++xKmM. 
ee, 


n times 


Thus, for all m € N the equality det@mA) = m” det A is fulfilled. 


2.14 Solution. 

The student is wrong. The determinant of the sum of two matrices is not always 
equal to the sum of the determinants of these matrices, which is confirmed by the 
following counterexample. 

Consider an identity matrix of size n x n. Then the inequality is valid: 


det + 7) ~ det J + det J, 


since detUI + J) = det(2/) = 2” (see Problem 2.13). At the same time, det J + 
det 7 =14+1=24¢det(/+/). 
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2.15 Solution. 

The expansion of the determinant by the formula (2.5) contains 5! = 120 
summands. One of these summands of the form a15a21a34da42a53 includes all the 
five multipliers of value 10000 and is equal to (—1)710000° = (—1)7 107°. Here 
o is the number of inversions in the permutation (5, 1, 4, 2,3). As is easy to see, 
o =6. 

The remaining 5! — 1 = 119 summands include no more than three multipliers 
10000 and, therefore, do not exceed 10000? x 1 x 1 = 10! in absolute magnitude. 
Hence it may be concluded that the considered determinant is no less than the 
difference 107° — 119-10! > 0. 

As a result, the determinant is positive. 


2.16 Solution. 
Relying of the reasoning provided in the solution of Problem 2.15, we obtain: the 
determinant in this case does not exceed the value 


(10 4 119 244510) <0, 


Therefore, this determinant is a negative number. 


2.17 Answer: 
Computing of determinants with the help of Python provides the following 
results: 


1 1 1 1 104 
in > at 4 
1 1 1 104 1 | =99999990001999850004, 
110* 1 1 #1 
1 1 10*1 1 
1 10*°3 4 § 
5 1 2 3 10 
4 5 1 104 3 | = —99999909053183731167. 


These results, of course, conform with the solution of Problems 2.15 and 2.16. 


2.18 Solution. 
Recall that in combinatorics the number of combinations of 1 various elements 
of k without iterations is denoted as C(n, k) [1, 60]. 
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In order to form the minor of the k-th order, one should select k rows and k 
columns from the matrix. The rows can be selected by C(m, k) methods, while the 
columns—by C(n, k) method. Applying the combinatory rule of product, we obtain, 
that all in all we can have C(m, k)C(n, k) minors of the k-th order. 


2.19 Answer: 

The rows for formation of the minor may be selected using the number of 
combinations, by C(n,k) methods. Then the columns should be selected so that 
their numbers should not coincide with the numbers of the selected rows. This can 
be done in C(n — k, k) ways. In all, according to the combinatory rule of product, 
we obtain the answer: C(n, k)C(n — k, k) minors. 


2.20 Solution. 
Consider the matrix A of size 3 x 3 


Q\1 412 413 
A= 421 422 423 


a31 432 433 
and write its determinant in the form 


det A = a1 1a22433 + 41242303) + a13d21432 


— 411423432 — 412421433 — 413422431 


=a+Bt+y 
+é+e+6, 
where designations @ = aj1a22033, B = 412023031, ..., § = —a13a22a31 are 
introduced. Each of the variables a, 8, ..., ¢ takes the values from the set {—1, 1}. 


All the six summands of the determinant cannot have the same sign. Indeed, the 
product wBy can be presented in the form of the product of nine elements of the 
matrix A: 


3 
apy = I] aij. 
ij=! 
At the same time, there exists the equality 6e¢ = (—1)3 iia aj = —apy. 
Therefore, among a, 6,..., ¢ there exist negative summands, and det A < 6. 
If five terms of the determinant have one sign, and the sixth term has a different 
sign, then det A as an even number. Then, det A < 5. 
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As is easy to find by direct calculation, the matrix 


-1 11 
1-1 1 
1 1-1 


has det A = 4. 


Finally we obtain: the greatest value of the determinant of the matrix of size 3 x 3, 
consisting of the elements +1 and —1, is equal to 4. 


2.21 Answer: 2. 


2.22 Answer: the determinant will be multiplied by (—1)'"/7!. 
2.23 Solution. 


# The number of multiplications 
count = 0 


def get_determ(A) : 
global count 


size = len(A) 


if size == 

return A[0] [0] 
elif size == 

count += 2 

return A[0] [0] * A[1] [1] - A[0] [1] * A[1] [0] 
else: 

det = 0 


# Expansion over the first row 

for col in range(size): 
minor = [row[:col] + row[col + 1:] for row in (A[1:])] 
det_sign = 1 if col % 2 == 0 else -1 
det += det_sign * A[0] [col] * get_determ(minor) 


count += 1 


return det 


mas = [[1, 2, 3], [4, 5, 6], [7, 8, 9]] 


print ("det =", get _determ(mas) ) 
print ("count =", count) 
2.24 Solution. 


For the matrix of size n x n, n recursive calls are performed and n multiplications 
are executed of the form a;; x Ajj, j = 1, ...,n. The exit from the recursion will be 
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at n = 1; no multiplications are executed in this case. Due to this, the total number 
of multiplications satisfies the recurrence relation: 


T(n)=nT(n—1)+n, n>, 
T(1) =0. 
Solve the obtained relation by the method of substitution, i.e. successively 
expressing T(n — 1) by T(n — 2), then T(n — 2) by T(n — 3), and so on: 
T(n)=nT(n—-1)+4+n 
=n[(n—1)Tn—2)+n-—1]+n 
=n(n—1)T(n—2)+n(n—1)+n 
=n(n— 1)[n—2)T(n—3)+n-—2]4+ntn—-1)4+n 
= n(n —1)(n—2)T(n—3) tnt — In —-2)4+ntn— 1) +n. 


Similarly continuing this process until T(1) = 0, we obtain 


T(n) =n(n — 1)(n—2)...2- TQ) 
+n(n—1)(n1—2)...2+n(n—1)(n—2)...34+---+n(n—1)4+n 


n! n! n! 
(Ee ss oR 
OT DE Ton Gil 


1 1 1 aoa | 


It is possible to write an analytical expression for T(n) by non-elementary 
functions, however, for solution of the posed problem it is enough to evaluate the 
asymptotic behaviour of the function T (7). 


Note that form — oo 


ee = ee ee 
lim —= —= —-—l=e-l, 
n—>0o k! k! k! 
k=1 k=1 k=0 
where e = 2.71828... is the base of natural logarithms. Hence we obtain the 
inequality 
aa 1 
! ! aes —1)n! 
n <a <(e-1nl, 
k=1 


and, finally, T(n) = O(n!). 
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2.25 Solution. 
According to the result of the previous problem, 7 (7) = n! Sa 


Let us expand the expression enI'(n, 1) — n!, taking into account ii definition 
of incomplete gamma function: 


nal a Le | 


enI\(n, 1) —n! =en- (n — I)!e AS Pama ga ataaly 1 


which coincides with T (n). 


2.26 Solution. 
Let us find the total number of inversions S(V) contained in all permutations of 


S(N 
N elements. The relation ) will be equal to the mean value of the number of 


inversions A(N) in the N -element array. 
In order to compute the variable S(N) suppose that some permutation 


(Gi,, iy, ..+, Gi, _,, i, ) Contains exactly o inversions. 
Note that in the permutation (q;,, di,_,,--., iy, @i,) the number of inversions is 
N(N — 1) 


5 —o. This means that the total number of inversions in the pair of arrays 


(Gi, , Qin, - ++, Gin_,, Gi, ) aNd (Gj, , Gi,_), +++, Giy, Gi) 


N(N -1 N(N - 1 
is equal too + (S29 weeny 


2 
Since there exist only N! permutations of the N-element array, then S(NV) = 


1 N(N — 
-—N! x obey and therefore there exists the following estimate of the mean 


value of the number of inversions: 


S(N 1 
A(N) = a =-N?+0(N) for N > oo. 
N! 4 
2.27 Proof. 

Denote the elements of the matrix B by b;;, 1 <i, j <n. Using the introduced 
designation, we can write that the matrix J + ¢B is formed by the elements (6;; + 
Ebj;). 

According to the definition, the variable det(J + ¢B) is equal to the sum over all 
possible permutations: 


det + ¢B) = yi 1)° (13, + Ebj;, ) (52; —- Eb.) .. 6, nin + Ebni,)- 


perm 
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Removing the brackets under the summation sign, we obtain 
det(] + ¢B) = Yo (HD? (ri 82i2 «+ Onin 
perm 
+ €b 1}, 52i53i5 -. - Snip 
+ €b 27514, 53:3 --- Snip 
+ €b31361i, 52:7... dni, +++ * 
+ €bni, 515, 52i9 - - - 8(n—1)in_1 
aa ee eee 
The product 6};,42;, ... dni, 18 equal to one, if 7; = 1,i2 = 2,..., in =n, and is 


equal to zero in other cases. 
Further, the products of the form 


Eb: S 1:1 52:9 « - « Ok—-1)ig_ OAD iggy +» + Onin 


reduce to the summands ebyx. 
This implies that 


det(I + €B) = (—1)(1 + ebi + ebx2 +--+ + Ebnn + O(€°)), 


where og is the number of inversions in the permutation (i1,i2,...,in) = 
(1,2,...,n). 

It is clear that o9 = 0. 

As a result we obtain 


det(I + ¢B) = (-)(1 +e) bet o(e)) —l+erB+ O(c). 
k=1 


2.28 Answer: det ( ) = 1— pe(tr B) + O(e?). 


I 
(I + €B)P 
2.29 Proof. 
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Represent the matrix M in the form M = G(J + eG !H ). Calculate the 
determinant 


det M = det G det(I + eG7!H) 
[o@) 


= detG exp(tr InVJ + eG 'H)) = detG exp G > 
k=1 


_4yk-1 
a . (eG"'H)*) 


= det G(1 + etr(G7'H) + seu (G~'H) —tr(G7'H)”) + O(e?)). 


Thus, the formula (2.80) is proved. 


2.30 Solution. 
The determinant of the matrix is equal to det A = ad — be. 
Calculate the cofactors for each of the elements of the matrix A: 


Ay =(-1)!*! .d=d, Ay =(-1)!*?-c=-c; 


Ay = (-1)"4!-b=—-b, Ay = (-1)*? a =a. 
Write a matrix of cofactors: 


d -c 


Then the sought inverse matrix will have the form: 


zi 1 d —b 


~ ad —be -—ca 


2.31 Solution. 


(1) Since det A = —2 + 0, then the inverse matrix exists. Find the cofactors of the 
elements of the matrix A: 


All =4. A12 = —3, 


Az, = —2, An = 1. 


4 3 
angi 


Therefore, the matrix of cofactors can be written in the form: 
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T 
413 4 -2 


Transpose the matrix (Aj;;): 


=2-1 —3 1 
For computing A~!, divide the obtained adjoint matrix by the determinant: 
“4 4 -2 —2 
ATS /(-2) = 
—3 1 3/2 —1/2 


(2) Since det A = | ¥ O, then the inverse matrix exists. Find the cofactors of the 
elements of the matrix A: 


Ai=7, Aj2=—5S, 
Azp=—4, Ag =3. 


T 
: 7 =5 7 -4 
Transpose the matrix (Ajj): = 
—4 3 9.43 
1 1 7 —4 7 —4 
Therefore, AT’ = = 
det A] —5 3 —5 3 


(3) Compute the determinant by the method of expansion in the first column: 


Bau Ae 5 —45 
det A =3 =) +3 =24—58+4+33=-140. 
5A 51 31 


Find the cofactors: 


—3 1 

Ai= =8, Ay=5, AB3=-l, 
—5-1 

A21 = —29, Aox2=—18, Az3 =3, 


A31=11, A32=7, A33=—1. 


8 5 -1 
We will obtain the matrix of cofactors: | —29 —18 3 
11 7 -1 


Perform the transposition operation: 


T 
8 5 -I ie ~29 11 
~29-18 3 | =| 5 -187 
1 ae | 
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With the help of division by det A write the inverse matrix: 


; 8 —29 11 —8 29 -11 

-1 _ = 
= aee A 5 -18 7 |= |-5 18 -7 
-1 3 -1l 1-3 1 


(4) Since det A = —3 ¥$ 0, then A7! is determined. Find the cofactors A; ji 


Ay=7, Aj2=—5, Ai3 = 6, 
Az} =—6, Aza =3, Ao3 = —3, 
A31=1, A32=1, A33 = —3. 


j 7 -6 1 
The i trix i lto At = = 

e inverse matrix is equal to ee 53 1 

6 —3-3 
7 1 
=a 2 = 
2 ete 
3 3 
—2 1 #1 


(5) Find the determinant: det A = —27 ¥ 0. The cofactors are equal to 


Ai =—3, Aj2=—6, Aj3 = —6, 
Ar) =—6, An =—3, An = 6, 
A31 =—6, A32=6, A33 = —3. 


Compute the elements of the inverse matrix: 


—3 —6 -—6 12 2 
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2.32 Answer: 
1 LG 0 
1 “a 
1-- 0 

1 2 1 1 

a 1 1 ae i 
(1) ; 2) 10 = -=]; ©) 

0 1 2 3 00 1 1 
2 3 4 
0 0 3 1 
0 oO - 
4 


2.33 Answer: [A, A~'] =[A, AW] = O. 
2.34 Solution. 
(1) As is known, the matrix A does not have the inverse one when the condition 


det A = Ois fulfilled. 
—-1lAx 


Calculate the determinant: | 2 4 0| = —13 3,2 = AIS + 3). 
6 4X 
Therefore, the matrix does not have the inverse one for A € {0, —3}. 


(2) The determinant is equal to —A3 + 347+ 4—3 = —(A—3)(A— 1)(A+ 1). The 
matrix does not have the inverse one for 4. € {—1, 1, 3}. 


2.35 Solution. 
For finding the inverse matrix, compute the determinant det A. 


1a00 
01,80 
0O0O1y| 
0001 


det A= 


det A = 1, since A is the upper triangular matrix and the determinant is equal to the 
product of the diagonal elements. 
Find the cofactors: 


Au=1, Ai=0, Ai3=0, A=), 
Aoi =—a, Azza=1, A23=0, Arg =0, 
A31=aB8, A32=—B, A33=1, Az4 =0, 
Aq) = —aBy, Aga= By, Aqgz=—y, Asg=1. 
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We perform transposition and obtain the adjoint matrix: 


1 0 00]. 1 -—a aB —aBy 
—~ 1 00) .|0 1-6 By 
of -810| |001 -y 
—aBy By -y 1 00 O 1 


Since the determinant is equal to one, then A~! coincides with the adjoint matrix: 


1 -—a a6 —aBpy 
Av! = 01 —B By 

00 1 -y 

00 0 1 


As is shown above, the determinant det A does not depend on the parameters 
a, B, y. Therefore, the matrix A7! is defined for any values of a, B, y € R. 


2.36 Solution. 

The determinant of the matrix G, = (g;;) is equal to det G, = (— a 
since non-zero product in the sum of the form (2.5) is equal to 81, n82.n—-1+++8n1 = 
1, and the multiplier o for this product takes the value o = (—1)""— Dp je 
Problem 2.5). 

Construct the matrix of cofactors of the elements 9;;, where 1 < i, j < n. The 
cofactor of any element equal to zero will be equal to zero. This follows from the 
fact that in case of deletion of the non-zero element in the corresponding minor 
appears a zero row and a zero column, and, in turn, such a minor is equal to zero. 
The minor of any element g; (,41)-;, located on the secondary diagonal, will be 
equal to det G,_1 = (—1)"~)@-®)/?. since after deletion of 8i,(n+1)—i we will 
obtain the matrix G,_; of size (n — 1) x (n— 1). 

Therefore, cofactors of such elements g;,,—-; are equal to 


Gert) det G4 = G1) see Oe 2 
—_ (poe — (a1 YP. 


Thus, the matrix of cofactors is equal to the original G,, multiplied by 
(— pyre 1/2. 

As is easy to see, transposition does not change the obtained matrix. The last 
step—divide the adjoint matrix by the determinant det G, = (—1)"~/?. Finally 
we obtain that the matrix inverse of G coincides with it itself: G~! = G for all 
values of n > 2. 
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2.37 Solution. 
Let us provide a code in Python for solution of the problem. 


import numpy as np 


def get Hilbert _matrix(n) : 
return np.matrix([[ 1 / 
for j in range(1, n 


(i + j - 1) 
+ 1)] for i in range(1, n + 1)]) 


matrix = get Hilbert matrix (6) 
inversed = np.linalg.inv (matrix) 
hhl = np.matmul (matrix, inversed) 
hih = np.matmul(inversed, matrix) 


print (matrix) 
print (inversed) 
print (hh1) 
print (hih) 


The difference of the elements of the matrices H - H~! and H~! . H computed 
with the help of Python from the identity matrix is ~ 107!° forn = 6, ~ 1077 for 
n = 8 and ~ 107° forn = 7. (Here, the symbol ~ means equality by the order of 
value.) 

Thus, Hilbert matrices demonstrate accumulation of machine errors when mak- 
ing computations with real numbers [58]. These matrices are very often used for 
testing of numerical algorithms. 

The matrix H~! can be found in an explicit form; the analytical representations 
for hj; are shown in [40]. An interesting peculiarity of this problem is also the fact 
that the elements of the inverse matrix are integer numbers. 


2.38 Solution. 

This matrix is an upper triangular one, and its determinant is equal to the product 
of the elements positioned on the main diagonal, i.e. is equal to one. 

Construct the matrix of cofactors. The cofactors of all unit elements positioned 
on the main diagonal will be equal to one. Upon deleting all zeroes except 
those standing on the diagonal below the main one, there appear matrices with 
proportional rows, therefore, their minors will be equal to zero. Those zeroes that 
are positioned on the diagonal below the main one, as cofactors will have the values 
(—A). The sum i + j for such elements is always odd, since they are positioned 
below the main diagonal, and upon their deletion we obtain an upper triangular 
matrix with one element A and other unities on the main diagonal. Upon deletion of 
X of any degree, except zero, we obtain an upper triangular matrix with zeroes and 
unities on the main diagonal. Therefore, both the minor and the cofactor are in this 
case equal to zero. 
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Finally, having executed the transposition operation, we obtain the inverse 
matrix: 


oOo Ff 
I 
oO FF ~ 
I 
— ~ f=) 
I 
~ Co Oo 
o oO O& 


2.39 Solution. 
Let us use mathematical induction method. 
Basis step 
For the least natural n = 1 we have 


(A\)~! = Aj? is true. 


Inductive step 
Assume that for = k the equality 


(AjA2...Ae-1Ax) | = Az‘ Az), ...A5' AT! 
is valid. Then we should prove that for n = k + 1 the following is true: 
(Al Aa... Akgi-1Akti) | = Aga A 


=i 14-1 
kei-1-77 42 Ay - 


Denote the expression for ae Sys Ay! by B, Then: 


1 1 1 1 1 1 1 
(BAgia) | = Agi Bo! = Api (4g 4p a 47D. 


Therefore, according to the mathematical induction method, Vn € N the identity 


(Aj Ati AniAn) SA, AL ty Ay 


is valid. 
2.40 Solution. 
’ od 21 
(1) Find the matrix inverse of the matrix A = ; 
02 
Its determinant is equal to det A = 4, the matrix of cofactors has the 


2M 3 . g-1 — | 2-1/4 
components , inverse of the matrix: A™* = 
—-12 0 1/2 
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—64 
We obtain the matrix X by multiplying A~! by the matrix B = : 
21 
ae 1/2 -1/4 de Neue (OU 
0 1/2 2-1 boy |. ae 
-111 
(2) Let us find the matrix inverse of the matrix A=] 0 22]. 
0 23 
The determinant is equal to det A = —2. 
The inverse of the matrix A~!: 
—11/2 0 
AMS 03/21 
0-1 1 
Multiply the matrix inverse of the matrix A by the matrix 
—2 11 
B=]|-1 0 2}: 
—1-20 
—-11/2 0 S22 liu ; 3 -2 0 
X=A™'B=| 0 3/2-1]|-10 2/=5]-14 6 
0-1 1 —1-20 0 —4—-4 


(3) We obtain the solution of the equation multiplying both sides of the equation 
X-A=B by A~! on the right. 
To do this, find the matrix inverse of the matrix A: 


_ | 7301238 
AT=5 5-39 
—30 14 36 
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Compute the elements of an unknown matrix X = B- A7!: 


0 -1 2 : —30 12 38 i —55 25 65 
xX = _ — — _—_ — = .—— a i, 
1 —2 —2 10 3 3.7 10 100 —46 —124 
0 1 10 —30 14 36 —305 143 367 


(4) From the equation A - X - B = C, find the unknown matrix X by the formula 
er mee oa ae 
We have 


pee oe ea 


~ 3/3 2] aL Se 
Consecutively perform multiplications in the following order: 
(AC). B 


Multiply the matrix inverse of the matrix A by the matrix C: 


1 }2-3]]1-1 1 |2-5 


Alc=— = 
133-2 | oa 13 | 34 


Finally, multiply the product A~!C by the matrix, inverse of the matrix B: 


1 )2-5 1 |-4 7 1 | —33 19 


“13 | gat |- 31) 5 <7 | 403 | 1799 


1 0 0 
2.41 Answer: |0 1 O|.- 


0 256a 1 
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2.42 Solution. 
Having calculated the several first powers A“, namely the second, third, fourth 
100 
and fifth powers of the matrix A = g1l1j|.,we have 
hol 
1 00 1 00 
A*=|2¢+h12|, A? =|3¢4+3h13]. 
2h O01 3h 601 
1 OO 1 00 
A*=|4¢+6n14|, A°=|5g¢4+10n15 
4h O01 sh O01 


Based on the obtained equalities, suppose that for all natural values of n the 
identity is fulfilled: 


n 


1 00 
A” =|ng+n(n—1)h/21n| . 
nh 01 


denote the respective predicate by P(n). 
Let us use the mathematical induction method. 
Basis step 


1 00 
Al=]1.¢+101—1)h/211| = A is true. 
1-h 01 


Inductive step 
Assume that form = k the predicate P(n) takes the true value, Then: 


1 00 
Ak =| ke +k(k —1)h/21k 
kh 01 
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Prove that forn = k + | the equality is valid: 


1 0 0 
AX =] (k+l g+k(kK+Dh/21k+1 
(k+1)h 0 1 
Indeed, 

1 00]/100 

k+1 _ 
AX =| kg 4 k(k—Dh/21k| | 911 
kh o1l||nol 
I 0 0 
=| (k+Dgtkk+Dh/21k+1 
(k+1)h 0 1 


Thus, the predicate P(n) is proved for all n € N. 
Substituting as the exponent of the matrix the number n = 512, we obtain the 


1 0 0 
answer: A*!* = | 256(2¢ + 511h) 1 512 
512h 0 1 
2.43 Solution. 
: n Fn-1 Fi, 
Denote by P(n) the predicate F” = 
Fy Fr+i 


Basis step 
The basis step is formed by the statement P (2): 


01 01 _ 11 7 F, Fo 
11 11 12 Fy) F 


’ 


which corresponds to the formula (2.85). 
Inductive step 
Assume that for n = k the statement is true: 


Fe-1 FE 
Fy Fett 


Na 
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Compute the matrix F” forn =k + 1: 


pee Fr-1 F Ol) | Fe Feit 


Fy Feyi |] .l Feat Fe + Fra 


According to the definition of the Fibonacci sequence, each element of this 
sequence is equal to the sum of two previous ones, and for all k > | the identity 
Fu-i + Fe = Fpx41 is valid. 

Thus, the predicate P(n) is proved for all natural n > 1. 


2.44 Solution. 
Let us try to find regularity in the sequence A!, A”, A?, .... To do this, raise the 
matrix to the second, third and fourth powers: 


lay lay 1 2a 2y + aB 
A*=|016]/}01g|/=l|01 286 |, 

001]|001 GO 1 

1 2a 2y + af lay 1 3a 3y + 3a8 
AP=A?-A=|01 26 O1p;/=|01 36 

00 1 001 00 1 

1 3a 3y + 3aB lay 1 4a 4y + 608 
At=A>-A=|0 1 3p O1pB|=|01 4, 

0 0 1 001 00 1 


Analysis of the sequence of the powers A!, A*, A>, ... leads to a hypothesis that 
iP na ny +n(n— Dap/2] 


A"=j01 np : 
I 0 1 | 
Let us prove the truth of this supposition with the help of the mathematical 
induction method. 


i na ny +n(n— bet? 
Denote by P(n) the statement “A” = | Q 1 np 7 
00 1 | 


Basis step 
The truth of the statement P(1) is obvious. 
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Inductive step 
Assume that P(7) is valid forn = k forsome k > 1: 


L ka ky + k(k —1)aB/2 
AK =10 1 kB 
00 1 


Prove that P(k) > P(k+ 1). 


Lka ky +k(k —)ap/2| | lay 
AKtH = AF. A=|0 1 kB 018 
00 1 001 


L(kK+ Da (k+ ly +k(k + lap/2 
=|]0 1 (K+ 1)B 
0 O 1 


Therefore, P(n) takes the true value for all n > 1. Thus it is proved that 


lnany +n(n — lafB/2 
AP 0 1 np 
00 1 


foralln EN. 


2.45 Solution. 
Assume that qg-th power of the matrix U() is determined by the formula: 


cos(qy) sin(g¢) 
— sin(qg) cos(q¢@) 


(U(g))4 = . where g € Z. 


Denote this statement by P(g) and prove it first for g € N. Let us apply the 
mathematical induction method. 

Basis step 

For n = 1 we have 


Gy ON) OO) ec aeaaewie 


— sin(g) cos(y) 
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Inductive step 
Assume that P(7) is true forn = k: 


(U(p))* _ cos(kg) sin(k@) 
— sin(kgy) cos(k@) 


Prove the truth of the statement forn = k+ 1. 


(u(y) _ | cos(kg) sin(ke) cos@g sing 
—sin(kg) cos(kg) — sing cosy 


cos(kg) cosg — sin(kg) sing cos(kg) sing + sin(kg) cosy 
— sin(kg) cos g — cos(kg) sing — sin(kg) sing + cos(kg) cos @ 


cos(k+1)g sin(k+ le 
— sin(k + 1l)g cos(k + l)g 


Therefore, for g € N there exists the equality: 


cos(qy) sin(g@) 
— sin(qg) cos(q¢@) 


(U(g))4 = 


Now it only remains for us to prove the truth of this equality for all integer q. 
Indeed, (U (y))° = I = U(0) and for all g € R there exists the inverse matrix 


U(y)'| =U(-¢). 


Thus, forg = 0,1,2,... the equality (U(g))~? = U(—q¢@) is valid. 
This means that 


Wight =| PP GE | tong ex: 


— sin(qg) cos(q@) 
2.46 Answer: 


[A, B27] = qB1! forallg € Z. 


Answers and Solutions 


2.47 Solution. 
Consecutively perform the algebraic operations: 


(1) 


f(A) = = 


01 03 02 
(2) 
1 -2 1 -2 1 -2 10 7 
f(A) = —3 +2 = 
3 1 3 1 3 1 01 -6 7 
3 -6 20 62 
—9 3 02 36 
2.48 Solution. 
(1) 
100]/100]]100 
g(A)=]110}}110]/}110]+ 
111]//111]/]111 
100 100 300 
=/310/+/110]—-—]030 
631 111 003 
(2) 


—2 1 0//-2 1 Oo] |-2 10 
g(A)=] 3 -11 3-11 3 -11 
210 210 210 


100 
110 
111 


100 
—3)010 
001 


-10 0 
4 -1 0 
7 4-1 


’ 
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—2 10 100 
+] 3 -11}]-3]010 
2 10 001 
—21 11 -3 —2 10 300 —26 12 —3 
=] 27 -13 5 ]}]+] 3 -11]—|]030]=] 30 -17 6 
7 -1 1 2 10 003 9 0 -2 


2.49 Solution. 


(1) The numerator of the fraction is equal to 


1 0 O 
0-41 0 
0 0 —-6 


In turn, the denominator forms the matrix 


—-4 0 O 
0 —46 0 |. 
0 O -6 


and the matrix that is inverse of it is equal to 


ia 0 © 
0 -1/46 0 
0 oO -1/6 


Having performed the multiplication operation, we obtain 


io oO) |=14 G :@ 23 0 0 
g(A)=|0-41 0 0 -1/46 0 |=1/92] 0 820 
0 0 =6 0 O -1/6 0 092 


Answers and Solutions 
(2) The numerator of the fraction is equal to 


-6 2 0 
0 -6 0 
0 2 -6 


The denominator of this fraction is equal to 


—6-2 0 
0 -6 0 
0 —2—-6 


Let us find the matrix inverse of the denominator: 
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—1/6 1/18 0O 
0 -1/6 0 
0 1/18 —1/6 
As a result we obtain 
-6 2 0 —1/6 1/18 0O 3 —20 
g(A)=] 0 -6 0 0 -1/6 0 = 4 030 
0 2 -6 0 1/18 —1/6 0-23 
2.50 Proof. 
Let us consider the sequence of integer non-negative powers of the matrix A = 
0-1 
10 
Ao 10 = 0-1 o -1 0 
01 1 0 0 -l 
01 1 
A= 4 and so on. 
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Therefore, the elements of the matrix exp(A) are defined by the sums 


00 ¢_4y\k 
(4) = 14 0/1! 1/21 40/314 1/4 +-- =e! » 


= 1, 
2d, Ob)! cos 
4 oo (-1)"1 
= 0-1/1! —0/2!+ 1/3!+ 0/4!4+---= ———— = -sinl, 
(e“)i2 /1! — 0/2! + 1/3! + 0/41 + 2 Gk +O sin 
ee Goel 
A = . 
=0+41/1!+ 0/2! — 1/3!+0/4!+-.--= ———_ = sinl, 
(e")21 + 1/1!+0/ joa 2 Get DI sin 
v3 (=D! 
A => _ — ee a _ — 
(e4)oy = 1+ 0/1! — 1/2!+ 0/3! — 1/4! + => oe = cos | 
Therefore, there exists the equality 
cos 1 —sin1 
exp(A) = 
sinl cosl 
2.51 Solution. 
(1) Compute the lower powers of the matrix A: 
100 100 100 
A°=|010}, A’=/010], 47=|010 
001 000 000 


It is clear that Vn > 1 (A” = A). The elements of the matrix exp(A) are 
equal to 


1 1 1 
C= C92 = le he eae, (e4)33 = 1, 


and the remaining elements take zero values. 
Therefore, 


e 00 
exp(A) = |0e0 


001 
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(2) The lower powers of the matrix are equal to 


100 010 001 
A°=|/010|, A'=l]001], 47=]000 
001 000 000 


As is easy to see, A? = O, and all the higher natural powers of this matrix 
are equal to zero. 


Finally we obtain 


111/2 
exp(A)=]01 1 
00 1 
2.52 Solution. 
10n 
(1) By the mathematical induction method, it is easy to prove that A” = | 010 
001 
for all integer non-negative n. 
Se oe 
k= k! kat k! 
oo 1 
According to the formula (2.49), we have eA = 0} mm 0 
k=0 *: oe 
oe 
The sums defining the diagonal elements converge to Euler’s number e. The 
CO k oo 1 oe 1 
sum (e4)13 = —= = — is also equal to e. 
fark! fa. (k-W! ek! 


Thus, write the answer: 


Pe ed 
ee 


(2) Having computed the arbitrary natural power of the matrix A, we obtain A” 


10 n 
nin(n+1)/2}. 
00 1 


Calculation of the diagonal elements of the exponential and the element 
(e4), 3 1s performed similarly to item (1) of this problem. 
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The element positioned at the intersection of the second row and the third 


oe 1 
column is defined by the sum (e4)o3 = > ae Transform this sum to 
n=1 n. 
the form 


= et 1 waht 1 1 = 2 
oe ae Lan > 


2(n — 1)! (n — 2)! + = 2(n — 1)! 
a 3 
Therefore, (e”)23 = =e 
As a result we obtain 
e0e 
eA=lee 2 
2 
00 e 
2.53 Solution. 
(1) According to the formula (2.53), we have 
i 2 
ae eae j 
or 
101 001 ; 001 001 
In}010]/=]000 735 000; +::-=|000 
001 000 000 000 


(2) After computing the lower powers of the matrix (A—J) we can write the general 
formula for (A — J)", where n > 1: 


00 Sn 
(A—1)" = | 841 0 8n1 + 5y2 | > 
00 O 


where 6;; is the Kronecker symbol. 
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This implies that 


101 001 
Injlil}=]10 : 
2 
001 000 
2.54 Hint. 
Use the formula (2.49) and apply the mathematical induction method. 
2.56 Answer: 


A square matrix of size 3 x 3 after reducing it to the echelon form with the help 
of the elementary transformations can take one of the following forms: 


lab lab lab lab 
Olc}], |Olc], |]OO1], |000 
001 000 000 000 


Here, by a, b and c are denoted the arbitrary real numbers. 
2.57 Answer: 


(1) addition the j-th row to the i-th row for j > i; 
(2) addition the j-th column to the i-th column for j <i. 


2.58 Solution. 


(1) Perform the following elementary transformations: add to the second row the 
first one, subtract from the third row the doubled first one, then add to the third 
row the second row. 


12 -—4 3-2 
We will obtain the matrix in the echelon form: A > | 05 —101 2 |, its 
00 0 012 


rank is equal to rk A = 3. 

Subtract from the second row half of the first one, subtract from the fourth row 
two first rows, swap the second and the third rows, subtract from the fourth row 
the second one, swap the third and the fourth rows and, finally, subtract from 
the fourth row half of the third row. 


(2 


wa 


2-43 10 
. 01-131 

We obtain A > , therefore, rk A = 3. 
00 -1-94 


00 0 00 
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(3) Let us use the elementary transformation method. Subtract from the second row 
the quadruplicated first row, subtract from the third row the first one, subtract 
from the fourth row the doubled first row. Then, subtract from the third row the 
second one, multiplied by 5/9. Finally, subtract from the fourth row the second 
one, divided by 3. 

Then we obtain 


12 1 3 

0 —9 —9 —18 
A= 

00 0 O 

00 0 O 


therefore, rk A = 2. 

Swap the first and the second rows, add to the third row the triplicated first 
row, add to the fifth row the doubled first row. Then, add to the third row 
the second one, multiplied by 11/2. Subtract from the fourth row the second 
one, multiplied by 5/2. Add to the fifth row the second one, multiplied by 5/2. 
Finally, subtract from the fifth row the third one, multiplied by 2/3. 


—1-4 -5 
2 —-4 
0 —30 |, therefore, 
0 0 
0 0 


(4 


Nw 


After the said transformations we obtain A — 


oo co Oo 


the rank of this matrix is equal to three. 

Perform the following elementary transformations: subtract from the second 
row the first row, multiplied by 1/3. Then, subtract from the third row the 
first one, multiplied by 1/3. And finally, subtract from the third row the 
quadruplicated second row. 


(5 


wm 


3 5 7 
After that we obtain A > | 0 1 /3 2/3}; the rank of such a matrix is equal 
00 O 


to two. 
(6) The matrix is presented in the echelon form, and, as is easy to see, rk A = 3. 
(7) Add to the second row the doubled first row, subtract from the third row the 
doubled first row, add to the third row the second row. 
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Then we obtain 


1-124 3 
A> |0-191012], 
0099 8 


and the rank of the matrix A is equal to three. 
Subtract from the second row the doubled first row, subtract from the third row 
the first one, multiplied by five. Then, subtract from the fourth row the first 
one, multiplied by seven. Subtract from the third row the doubled second one 
and swap the third and the fourth rows. Finally, subtract from the third row the 
doubled second row. 

After the said transformations we obtain 


(8 


wm 


13 5-1 

0-713 6 
A> 

00 0-4 

00 0 0 


therefore, rk A = 3. 


2.59 Solution. 

Consider the minor of the first order M i = B —y.If 6 S$ y, then the rank of the 
matrix A is no less than one. 

a ; i2 _ |B-y 0 

en, consider the minor of the second order M 2 = . If the 
yay 

condition y 0 is fulfilled, then rk A > 2. 

Finally, compute the bordering minors of the third order: 


p=y 0 
My33 =|y-a-y 0|=0, 
a—p Bp -a 
p=y 0 =, 
M23 =|y—a-y a|=0. 
a—-Bp Bp 0 


Therefore, the maximum value that the rank of the matrix A can take is equal to 
two. 
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Note that with the help of the Kronecker symbol (see page 4) the formula for the 
rank of this matrix can be written in the form rk A = 2(1 — 4y065g06y0). 


2.60 Solution. 


(a) Use the bordering minor method (see page 64): 


Mi=140, Mi3= 2{=% 
11 
Then, consider two cases. 
110 110 
(2)—() 
(1) If A = 1, then the matrix is equal to |} 110] ~ @?@) | 001 J. Itis clear 
001 000 


that its rank is equal to two. 
(2) If A # 1, then we compute the bordering minor of the third order (it is the 
only one): 


Therefore, the rank of the matrix is equal to 


2, if A=1, 
3, if ASA. 
(b) The lower minor M t =1-.. 
Then, consider two cases. 
000 
(1) If A = 1, then the matrix is equal to } 0 1 0 |, and its rank, as is easy to see, 
002 


is equal to two. 
(2) If A ¥ 1, then we continue to compute the bordering minors: 


My3 = a= (1=HC = 2); 
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—100 
If 1 = 2, we obtain the matrix | 0 0 0 J, its rank is equal to two. 
001 


If A ~ 2, then we need to compute the bordering minor M@ a :M oo = 
(1 —A)Q—A)(3—-A). 


—2 0 0 
In the case when A = 3, the matrix is equal to | Q —1 Q |, its rank is equal 
0 00 


to two. Otherwise, the rank is equal to three. 
As aresult, we form the answer: 


2, ifae {1, 2, 3}, 


3, otherwise. 


(c) Since M Fi = | £0, then the rank of the matrix is no less than one. 
Then, consider the minors of the second order: 


i 2 
Mi3 = =4-). 
“Jat 2—a 


This minor is not equal to zero at A + 4. For this case, consider the bordering 
minor of the third order: 


The determinant is equal to zero for 4 = 2 + 3/2. For such values of A the 
rank is equal to two; for other values the rank is equal to three. 

Now consider the case 4 = 4. 

Then the non-zero minor of the second order: 


1,2 2 —3 
My = = 1440. 


2-2 10 


Then, the rank is no less than two. The third order minor is the only one, but 
it is only equal to zero for 4 = 2 + 3/2, as is shown above. Then, in this case 
the rank is equal to three. 
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So, forA = 2+ 3/2 the rank of the matrix is equal to two, for other A the 
rank of the matrix is equal to three. 
(d) There exists M3} = | £0, therefore, the rank of the matrix is no less than one. 
Then, consider the minors of the second order: 


1 
a SAGA): 


| ee 
sion re eae | 


This minor is not equal to zero at A # 0 and for A ¥ 1. For this case, such 
for the minor of the third order: 


-r~ 1 1 
1—-A 1 |=AQA—1)2-A). 
0 2-i2 


1,2,3 __ 
M)5°3 =| 0 
0 

AQ — 1)(2—A) £ 0 for aA ¥ 2 (since X ~ 1 and A F 0). In this case, the 
rank is equal to three. 


For 4 = 2 the rank is equal to two. 
Assume that now A = 0: 


Then, the rank is more than two. 


O11 
011;=0. 
002 


Then, in this case the rank is equal to two. 
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(e) 


Now consider A = 1: 


-111 
0 01 
0 01 
; 12 11 
The rank is equal to two, because M,’; = = |  O, and the 
; 01 


determinant of the matrix is equal to zero. 

Therefore, the rank is equal to two for 4 € {0, 1, 2}, and is equal to three in 
other cases. 
The minor of the first order is M + = | # 0, therefore, the rank of the matrix 
takes the value no less than one. 

Let us find the bordering minor of the second order: 


This minor is not equal to zero if A ~ —4. Let us find the minor of the third 
order for such values of A: 


1 -6 —-5 
1,2,3 2 
My53 =|-12-4 5 |=A°+8A+4+ 16. 
-1 6 1-d 
This expression may only be equal to zero for A = —4. Therefore, in this 
case the rank is equal to three. 
Now consider the case when A = —4. 
1 —6—-5 
-1 6°55 
-1 6°55 


The rank of the obtained matrix is equal to one. 
As a result, the rank is equal to one for 4 = —4, and is equal to three for 
AA —4. 
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(f) There exists M ‘i = | 40, therefore, the rank is no less than one. 
Let us find the minors of the second order. 


»_|l-a 2 
: tae 


My’ = A(A — 3). 


It is not equal to zero if A # O and A ¥ 3. Find for such values of the 
parameter A the bordering minors of the third order. 


t=%. 2 0 
Mi5'3 =| 1 2-a 0 |=AQ—3)GB—-A). 
0 oO 3-A 


The third order minor is non-zero for all considered in this case 2. Let us find 
the minor of the fourth order (it is the only one): 


1-’A 2 0 0 
1 2-A 0O 0 
0 0 3-A 
0 0 0 4-A 


== 37G2H4= 


It is equal to zero only for A = 4. For this case, the rank is equal to three. For 


2. # 4 the rank is equal to four. 
Then, consider the value A = 0: 


30 
3,4 
My, = = 1240, 
404 
300 
Mey = 030) = 2440. 
004 


For 24 = 0 the determinant of the initial matrix is equal to zero. Then, the 
rank is equal to three. 
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For A = 3: 
—2 2 00 
1 -100 
0 000 
0 001 


There is no minor of the third order that is not equal to zero. Therefore, the 
rank is equal to two. 
Finally we obtain the rank is equal to two for A = 3, is equal to three for 
2 € {0, 4}, is equal to four in other cases. 
(g) There exists M i # 0, therefore, the rank is no less than one. 
Let us find the minors of the second order: 


1-2A 2 
1 l+a 


M3 = =-(7 +1) £0. 


This minor is always other than zero. Consider the bordering minors of the 
third order: 


baie eg «0 
1 1+a 0 | =(2—A)(—A? — 1) = Oonly fora = 2, but 
0 0 BHA 
ite ONG 
1 14a 0 |=(@+A)(—A? - 1) = 0 only for A = —2. 
0. 0 B44 


This is because the minors of the third order are not simultaneously equal to 
Zero. 

Calculate the determinant of the initial matrix (in other words, find the minor 
of the fourth order): 


1-A 2 0 0 
1 14+,a 0 0 
0 0 2-A 
0 0 O 2+A 


= 24002 =)EH = 1): 


120 2 Matrix Algebra 


For A = +2: 

(2+ A)(2 — A)(—A? — 1) = O, therefore, the rank is equal to three. 

For other A the rank is equal to four. 

So, the rank is equal to three for A = +2, and equal to four for A # +2. 
(h) There exists M ‘i + 0, therefore, the rank is no less than one. 

Let us find the minors of the second order: 


1+22 
Ms = i A ee Sal 


For this case, find the minors of the third order: 


1+a2 4 
Mix =| 1 1 0/=-Q-1) #0 
1,2,4 ~~ . 
0 0-1 


Then, the rank is greater than or equal to three. Calculate the determinant of 
the initial matrix: 


(RRO? 0. A 
i 8. OF Oil. ec 
SP 22212, 
0 02+4+A-1 
102 1 


The determinant is equal to zero for A = 4 or —3. Hence, for such values 


of the parameter 4 the rank is equal to three. In other cases the rank is equal to 
four. 


Now consider the case when A = 1. Let us find the minor of the second 
order: 
10 
2,3 
My = =340 
2,3 
03 


Calculate the minor of the third order: 
20 4 
MS =/10 0|/=1240. 
03-1 
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Then, the rank of the matrix is no less than three. Now find the determinant 
of the initial matrix. 


220 4 
110 0 
003-1 
102 1 


=-1240. 


This implies that in this case the rank is equal to four. 

So, the rank is equal to three for A € {—3, 4}, and is equal to four in other 
cases. 
The size of the matrix is equal to (n + 1) x (n + 1). As is easy to see, for 
2d € {0,1,...,} the determinant of the matrix takes the value equal to zero. 
We should also note that in this case there exists a minor of the order n, other 
than zero. For example, for 7 = 0: 


(i 


Na 


As is known, determinant of the upper triangular matrix is equal to the 
product of the diagonal terms of the matrix. Since the condition A = 0 is 
fulfilled, then this product is not equal to zero. Therefore, the rank of the initial 
matrix in this case is equal to n. Yet, if the condition A € {0,1,...,m} is not 
fulfilled, then the rank of the initial matrix takes the value equal to n + 1. 


We obtain the final answer: the rank of the matrix is equal ton ford € {0,1,..., ”}, 
and is equal to n + | for other values of 4. 


Chapter 3 M®) 
Systems of Linear Equations peels 


The system of m linear equations with n unknowns is written as 


Q1X1 +ay2x2 + +++ + 1nXn = Dy, 


ahs = bp, 
a21X1 + a22x2 + + d2nXn 2 3.1) 


Ami X1 + Am2X2 + +++ + AmnXn = bin. 


Here, by x1, x2,..., X, are denoted unknown numbers, a;; and b; are prescribed 
numbers, also referred to as coefficients of the system of equations (3.1). Variables 
b; are called known terms or right-hand sides of equations. 

Solution of the system of equations is such a collection of n numbers, which 
when x, X2,...,Xn are substituted into the system in place of the unknown, turns 
all the equations into identities. The solution is written as a vector [x1,x2,..., xn’. 

If bh} = O for alli = 1,2,...,m, then the system of equations is called 
homogeneous. If at least one of the known terms is b; 4 0, then the system of 
equations is called non-homogeneous. 

In case when m = n, the system of equations is called square, and when m ¥ n, 
the system is called rectangular. 

A system of linear equations is called consistent, if it has at least one solution, 
and inconsistent, if there are no solutions [65]. 

If a consistent system has the only solution, it is called determined. If the 
consistent system has at least two different solutions, it is called undetermined. 
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A matrix 


ail a\2 oes Ain 
a2] a22 seare. Qa2n 
A= ; (3.2) 
Am1 Am2 Amn 
consisting of coefficients of the unknown a;; is called a system matrix. 
A matrix 
ai, aj2... Ain | dy 
a2, a22 dan | bg 
B= ; (3.3) 
Am1 Am2 +--+ Amn | bm 


into which a column of the constant terms b; is added is called an augmented 
system matrix. 


If the unknown and constant terms are written in the form of a column (of 
matrices of sizesn x | andm x 1, respectively): 


xa by 
x2 bo 
X= , B= F (3.4) 
Xn bin 


then the system of equations (3.1) can be presented in an abridged matrix form: 


A-X=B. (3.5) 
3.1 Cramer’s Rule 
Consider a square system of n equations with n unknowns: 
ayix1 + a12x2 + +++ + inXn = bi, 
21x, + ao2x2 +--+ + aanXn = bo, 
21*1 22%2 2nXn 2 (3.6) 


Gn 1X1 + An2X2 + +++ + AnnXn = dn. 
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Theorem 3.1 (Cramer’s! Rule) /f the determinant of the matrix A of the sys- 
tem (3.6) is other than zero, then the system (3.6) is determined, i.e. it has the unique 
solution. This solution can be computed by the formula 


ee FU (3.7) 


Here, A; is the determinant of the matrix obtained from the initial matrix A by 
replacement of the i-th column with a column of constant terms: 


ayy... Ay i-1 Dy Ay ig)... Gin 


az, ... a2i-1 bz az i+] ... Arn 
Apa [een 3.8) 


Gn ++» Ani-1 On an i+1l --+4nn 


If A = O and at least one of the determinants A; is other than zero, then the 
system (3.6) has no solutions (i.e. it is inconsistent). 

If A = 0, but also A; = 0, then the system (3.6) has infinitely many solutions 
(i.e. it is consistent but undetermined) or is inconsistent. 


Example 3.1 Solve the system: 


5x — 6y = —8, 
(3.9) 
5x + 6y = 28. 
For the given system we have 
5-6 —8 -—6 5-8 
A= = 60, Ay, = = 120, Ay= = 180. (3.10) 
5 6 28 6 5 28 
Thus, according to Cramer’s rule 
A 120 Ay 180 
2 ie Se pe SS, (3.11) 
A 60 A 60 


'Gabriel Cramer (1704-1752) was a Genevan mathematician. 
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Example 3.2 Solve the system of linear equations: 


2x —y+3z= 8, 
x+y—2z=5, (3.12) 
3x -—2y+z=7. 


Solution Compute the determinants required to apply Cramer’s rule: 


2-1 3 8-1 3 
A=}1 1 —-2}=—14, Ay =/5 1 -—2|=-—S56, (3.13) 

3-2 1 7—2 1 

28 3 2-18 
Ay =|15 —2| =—42, A,=]1 1 5|=-14. (3.14) 

37 1 3-27 

Therefore 
~=4 3 Ee esi (3.15) 
x=—=4, y=— 3, 2= =1. : 
y Zz x 


3.2 Inverse Matrix Method 


Consider the system of equations (3.6). Write this system in the form A - X = B in 
accordance with (3.5). 

Let the matrix A have the inverse one A~!. Multiply both sides of the equality 
A-X = Bby A! on the left: 


Al.A-X=A?!.B. (3.16) 


Transform the obtained matrix equation. Since the identities A~! - A = J and 
I. X =X are valid, the solution of the system X can be written in the form 


X=A!.B. (3.17) 


Therefore, if we find the inverse matrix A~!, then the solution of the system can 
be obtained as the product of the matrices A~! and B. 
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Example 3.3 Solve the system of equations 


—2x + 2y —3z = —10, 
2x—-y+2z= 7, (3.18) 
3x-—y+3z= 10 


by the inverse matrix method. 


Solution The matrix of the analysed system of equations A has the form: 


2.243 
2-12]. (3.19) 
3 -1 3 


We find the inverse matrix using a cofactor matrix. The determinant of the matrix 
A is equal to 


-12 22 2-1 
det A = —2 —2 =3 
—-13 33 3-1 


= (—2)(—3 + 2) — 2(6 — 6) — 3(-2 + 3) = -1. (3.20) 


Calculate the cofactors: 


=12 22 
Ay =(-1!t! =-1, An =(-)'? =0, (3.21) 
-13 33 
D1 9: 523 
Ai3 = (-1)'*9 =1, Ay =(-1)**! = -3, (3:22) 
a4 ee 
=9.=3 29-0 
Az = (-1)°*? = 3, Ap3 = (-1)°*? =4, (3.23) 
33 3. a] 
2 -3 —2-3 
A31 = (-1)*t! = 1, Ax = (-1)**? = -2, (3.24) 
a ee g°°3 
ey 
Ay = (-1)3*3 73 (3.25) 
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Write the matrix formed by the cofactors: 


-10 1 
—33 4]. (3.26) 
1 —2—-2 


As a result, the desired inverse matrix will have the form: 


2-351 iE ae 
Se ae SB Sp Oo, Be ts (3.27) 
(-1) 
it ae 2249 


Then, the solution of the system will be found using the matrix multiplication 
operation: 


x 1 3 —1] |} -10 —10+21—10 1 
y}=] 0-3 2 7 |=] O-214+20 |}=]-1].- (3.28) 
Zz —-1-4 2 10 10 — 28 + 20 2 


3.3. Gaussian Elimination 


The use of the notion of matrix rank allows obtaining the criterion of consistency of 
the system of linear equations. 
Consider the system: 


QyjX] + aj2xX2 +++ + 4inXn = by, 
a2, X1 + doax2 +--+ + aanxn = bo, 
21X1 + a22x2 2nXn 2 (3.29) 


Am1X1 + Am2x2 + +++ + AmnXn = bmn. 
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The augmented matrix of the system of size m x (n + 1) has the form: 


a1 a2... Ain | by 
a2, a22.... An | bo 
B= 7 (3.30) 
ener rn ives 
Am\ Am2--- Amn | bm 


The consistency criterion for the system of linear algebraic equations is the 
Kronecker-Capelli* theorem (also referred to as the theorem of Rouché*—Capelli) 
[64]. 


Theorem 3.2 (Kronecker—Capelli Theorem) A system of linear algebraic equa- 
tions (3.29) is consistent if and only if the rank of the basic matrix A equals to the 
rank of the augmented one, i.e. rk A=rk B =r. 


Ifr =n, then we obtain a square matrix with a non-zero determinant. Its solution 
exists and it is unique. 

Ifr <n and the system is consistent, then there exists an infinite set of solutions. 

Ifrk B > rk A, then the system is inconsistent. 

In what follows, by zero equations we will understand the equations of the form 
O-x, +0-x2+---+0-x, =0. 

Elementary transformations of the system of linear equations are: 


. interchange (swap) of any two equations; 

. multiplying the equation by any non-zero number; 

. adding to the equation another one multiplied by an arbitrary number; 
. dropping the zero equations. 


BRWN re 


Gaussian* method or Gaussian elimination consists in transformation of the 
augmented matrix B with the help of the elementary transformations to the echelon 
form. Such transformations are aimed at obtaining a system of the form: 


bi Xp +2 + X24 + Dip Xp +++ + bin Xn = Pi, 


boo + X2 +++ + boy Xp +++ t+ bon + Xn = pr, 3.31) 


Dep Xp Ht + Den + Xn = Pr- 


Note that for bringing the system of equations to the said form we may need to 
change the numbering of the variables. 


2 Alfredo Capelli (1855-1910), Italian mathematician. 
3Eugéne Rouché (1832-1910), French mathematician. 
4Johann Carl Friedrich Gau8 (1777-1855), prominent German mathematician and astronomer. 
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The system (3.31) may contain less equations than the initial one due to dropping 
of the zero equations of the form: 


O-x, tO-x9 +---+0-x, =0. (3.32) 
If the transformations result in the equation 
O-x, +0-x2 +---+0-x, =d 40, (3.33) 


then the system is inconsistent. 
It is obvious that the minor M!: 1 a ive ; is a basic minor, assuming that x-41,...,Xn 
are free unknowns, to which we can. assign arbitrary values: 


Xp41 = C1, 0.5 Xn = Cr-r- (3.34) 


Rearrange these variables to the right side; then the obtained system necessarily 
has a solution relative to the unknowns x1, x2,..., Xp. 
From the last equation we find x;, from the last but one we find x,—1, etc. 


Note Gaussian method is sometimes referred to as method of successive elimina- 
tion of unknowns. 


Example 3.4 Solve the system of equations by Gaussian method: 


x1 — 2x2 + x4=-3, 
3x1 — —2 = |, 
oe ees (3.35) 
2x, + x2-2x3- x4= 4, 
xX, + 3x2 —2x3-2x4 = 7.z 
Solution Write the augmented matrix: 
1=2 0 1 | | 
3-1-2 0|1 
(3.36) 
2 1 -2-1| 4 


{3 9-3 7 
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Bring the obtained matrix to a triangular form. For this, subtract from the second 
row the triplicated first row, from the third row—the doubled first one, and from the 
fourth row—the first one: 


9-0. i '=3 
0:5 923,710 
0 5 —2-3] 10 
O52 =37/-10 


(3.37) 


In the next step, subtract from the third and the fourth rows the second one. The 
first and the second rows remain unchanged: 


19 O~ 3 
5 S223 10 
00 0 010 
00: 3070.10 


(3.38) 


It is clear that the ranks of the basic and the augmented matrices are equal to two. 
The system is consistent and undetermined. As free variables, take x; and x3. Let 
x, = C}, and x3 = C2. Then we have 


2. = -—-3-C, 
x2 + X4 1 (3.39) 
5-x2-—3-x4=10+4+2-Co. 
We are solving this system relative to the variables x2 and x4: 
= —-14+3-C,-2-C), 
7 : ; (3.40) 
x4 =—-54+5-C, -—4-C2. 
We write the final answer in the form: 
| a | 0 ! | 0 
x —14+3C,; —2C —1 3 —2 
5 ne le : ale SCi ao ar . (3.41) 
X3 C2 0 0 1 
X4 —54+5C; —4C2 —5 5 —4 
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Example 3.5 Solve the matrix equation relative to the unknown matrix Y: 


= (3.42) 


Note the degeneracy of one of the multipliers, namely the matrix 
13 


13 
Solution Note that the degeneracy of the matrix does not allow us to use the 


equation solving method with the help of the inverse matrix. In this case, reduce the 
problem to the system of linear equations. 
Denote the elements of the matrix Y by y1, y2, y3 and ya: 


pee. (3.43) 


y3 4 


Successively expand the product of the matrices: 


Bh) Ve 98 |) | 3 |: | 2G yi- ys ya 4 13 
—1 2 | | y3 ya] | 13 3° 2 —yr+2y3 —ya + 2y4.| | 13 


—y3- 3 3y2 — 3y3 — 3 
= yi + y2—- y3— y4 yi t+ 5y2 y3 y4 (3.44) 


—y) — yo + 2y3 + 2yq —3y1 — 3y2 + 6y3 + Oy4 


—2-6 
The obtained matrix is equal to : 


Equating the respective elements of the matrices, we obtain a non-homogeneous 
system of linear equations relative to the unknowns yj, y2, y3 and ya: 


yity2—y3—- y4 = —2, 
3y, + 3y2 — 393 — 3y4 = —6, 
yl y2 ¥3 y4 (3.45) 
—yi —y2t+2y3+2y4 = 3, 


—3y1 —3y2 + 6y3 +64 = 9. 
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Write the augmented matrix of this system: 


1 i Sat} 
4 3-3-3 (-<6 
Stas B74 
3-3 6 6|9 


(3.46) 


Subtract from the second row the first row, multiplied by 3; then, subtract from 
the fourth row the third row, multiplied by 3: 


a es ee: 
000 010 
4-9 3) 3 
000 010 


(3.47) 


Then, add to the third row the first row. The zero rows do not influence the 
solution, this is why we obtain the following equivalent matrix: 


11-1 -1 | -2 
(3.48) 
Ooo 1 1 {1 
It corresponds to the system of equations with four unknowns 
+y2-y3-y4 = —2, 
Y1 1 Y2 — Y3 — V4 (3.49) 


yty= 1. 
The ranks of the basic and augmented matrices are equal to two. Therefore, we 
conclude that the system (3.49) is consistent and undetermined. 
As the independent variables, select, for example y2 and yq4: 


y2=Ci, ya=Cr, where Ci,C2 €R. (3.50) 


Having substituted (3.50) into the system (3.49), we obtain 


yr + Ci — y3— C2 
y3 + C2 


a9 
1, 


(3.51) 
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or 


=-1-C, 
ms : (3.52) 
y= 1-C. 
Write the coefficients yj—y4 in the form of a column: 
| -1 i | 0 
y2 0 1 0 
= + C + C2 : (3.53) 
y3 1 0 -1 
y4 0 0 1 
As a result, the matrix Y is equal to 
-1-C, C, 
Y= , (3.54) 
1—C2o C 


where C}, C2 are arbitrary real numbers. 
Thus, the matrix equation (3.42) has the infinite set of solutions, depending on 
two real parameters. 


Check 
By direct substitution of the obtain matrix into (3.42) it is easy to check that Y is 
determined correctly: 


1 -l -1-C,C; 13 
—-1 2 1—Co Co 13 


—2-—C,+C2 Ci-C 13 —2 -6 


3+ Cy —2C2 —Cy +2C2] | 13 3 9 


Assume that A and B are arbitrary matrices with sizes m x n and n x p, 
respectively. Estimate of the rank of the product of the matrices A and B is provided 
by the following theorem. 


Theorem 3.3 The rank of the matrix product satisfies the inequality 


rk AB < min(rk A, rk B). (3.55) 
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In other words, the rank cannot increase when the matrices are multiplied [63, 
64]. 

There exist many various methods of solving system of linear equations. 

Gaussian method is one of the most frequently used. 

Consider realization in Python of Gaussian method of solving system of linear 
equations (Listing 3.1). 

For definiteness, we will form the solutions for the systems where the number 
of equations and unknowns coincides, but the provided algorithm can easily be 
transformed for the systems with an arbitrary relation between equations and 
unknowns. 


Listing 3.1 
1} import math 
2 
3 
4/def gaussian _elimination(A, B): 
5 len (A) 
6 len (A[0] ) 
i 
8 if len(B) != m: 
9 raise ValueError 
10 
11 [[A[i]l [j] if j != n else B[i] \ 
12 for j in range(n+1)] for i in range(m)] 
13 
14 # Forward elimination 
15 for r in range(min(n, m)): 
16 max_row_pos = r 
17 
18 # Pivoting strategy 
19 for i in range(r + 1, m): 
20 if abs(C[il]l[r]) > \ 
21 abs (C[max_row_pos] [r]): 
22 max_row_pos = 
23 
24 C[r], C[max_row_pos] 
25 C[max_row_pos], 
26 
27 if math.isclose(C[r] 
28 continue 
29 
30 for i in range(r + 1, 


31 factor = C[il] [r] 
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33 for j in range(r, n + 1): 
34 Clil[j] -= factor «+ Clr] [j] 


# Back substitution 
answer = [0] *«n 


for i in range(min(n - 
s = 0.0 


for j in range(i + 
s += Cli] [j] * answer[j] 


if not math.isclose(C[i] [i], 0): 
answer[i] = (C[il[n] - s) / C[i] [i] 

elif not math.isclose(C[i] [n] - s, 0): 
return None 


1 in range(n, m): 
s = 0.0 


for j in range(n): 
s += C[i][j] * answer[j] 


if not math.isclose(C[i] [n] 
return None 


return answer 


Two parameters arrive at the input of the given function: the coefficient matrix 
with the unknowns in the system of equations A and the right side matrix B. 

The realization of the function consists of three main steps. In the first step, the 
matrix C is constructed, which is obtained by attributing the matrix B to the initial 
matrix A on the right. 

Then, the so-called forward pass is performed with the purpose of bringing the 
matrix to the echelon form (that is to the form when each successive row, viewed 
from left to right, contains more zeroes than the previous one). This procedure is 
performed by applying to the matrix C of a series of elementary transformations 
by the following algorithm: successively, starting from the first one, all columns 
are scanned. Among the elements of the current column, the one with the greatest 
module is found, referred to as the basic or pivot. Then, from each row, a row 
is subtracted that contains the pivot and is multiplied by the coefficient equal to 
the relation of the row element in the considered column to the pivot. Thus, all 
the elements in the column, except the pivot, become equal to zero. The process 
is performed until the matrix has a row left that contains only two variables: one 
coefficient of the unknown and one value in the right side. 
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In the third step, the “backward pass” is performed. The values of all unknowns 
are consequently expressed in terms of the already found variables. So, the unique 
solution is obtained or it is determined that there are no solutions or the the set of 
solutions is infinite. The backward pass starts from the row containing the minimum 
number of non-zero coefficients, and continues until all the unknowns are expressed 
in terms of the already known ones, or until it is established that there is no unique 
solution. 

The asymptotic complexity of Gaussian elimination, due to triple loop nesting 
by the variables r, i, j, is equal to O(n3), where n is the number of equations in 
the system. 

Let us give an example (see Listing 3.2) of using the function 
gaussian _elimination(A, B) for solving the system of equations whose 
matrix of size 100 x 100 has unities on the secondary diagonal and other elements 
equal to zero. The column B is equal to [1, 2, 3,4,..., 100]. 


Listing 3.2 


100 


[[0 for j in range(size)] \ 
for i in range(size) ] 
[0 for i in range(size) ] 


for i in range(size) : 
for j in range(size): 
ALG (gd -Ssbeak 3 = i i 1 else 0 


for i in range(size) : 
Bli] = float(i) 


print (gaussian _elimination(A, B)) 


3.4 Fundamental System of Solutions of Homogeneous 
Systems 


Consider a homogeneous system of equations that has the form 


Qy1X] + aj2xX2 + +++ + 4inXn = O, 
a21X1 + a22x2 + +++ + armnXn = 0, (3.56) 


Am1X1 + Am2x2 + +++ + AmnXn = 0, 


where m is the number of equations, n is the number of unknowns. 
This system can be written in a matrix form: 
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A-X=0, (3.57) 


where A is the system matrix, and by X is denoted the column formed by the 
variables: 


x1 
X2 
X=] |. (3.58) 
Xn 
This system necessarily has the solution X = [0,0,..., olf, which is called 


trivial. 
Our purpose is to find all non-trivial solutions, if any. 


Theorem 3.4 Let the matrix A of a homogeneous system of equations have the size 
m xn and the rank r. If r = n, then the system has only a trivial solution. If 
r <n, then there exist exactly n — r linearly independent solutions, referred to as 
fundamental system of solutions. 


Suppose that we have found the basic minor and it is located in the upper left 
corner (otherwise, the order of variables and equations may be changed). Let us 
keep only those equations whose coefficients are included into the basic minor, that 
is from the first to the r-th. The unknowns from number r + | to number n are 
referred to as free and rearranged to the right side of the equations: 


AYX] Fee + Air Xp = —Air41Xr+1 — +++ — AinXn, 
USS nec sh trad padi) Sy wince ase mean eee nora ay een (3.59) 
Ap 1X1 Ft + pr Xp = Apri Xp41 — +++ — ApnXn 
Let us introduce for consideration a square matrix 
ai, ..- 1p 
Cet |i Fae 8b (3.60) 
ay| ..- arr 


and, according to the property of a basic minor, the following inequality det C 4 0 
is valid. 
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Denote 
"| ad 
x2 Xp42 
y=| |eR’, z=] |eR™. (3.61) 
Xr Xn 
Then, select n — r linearly independent vectors Z1, Z2,..., Zn—r. Usually, 


collections are taken that form the so-called canonical basis in R’~": 


1 0 
0 1 

Z21;=|]0], Zo=]0], ...,0 Zp =] 0 (3.62) 
0 0 1 


Let us call F, the vector in the right side of the equation for the given Z, .. 
F,-,—for Zn_r. 
In this case, we have the systems of equations in a matrix form: 


9) 


C.-Y, = Fi, 
Uetearese wiidesas (3.63) 
C: Yn-r — “n-r 
According to Cramer’s rule, the solutions Yj, Y2,..., Y,—-,- are uniquely 
determined. Then the full solution of the system will consist of the vectors: 
Y Yn— 
5 es al ere eee aan | (3.64) 
Z1 Zn-r 
that are linearly independent, since Z;,..., Z,—, are linearly independent. 
The set of the solutions X,, X2,..., X;_,; represents the fundamental system 


of solutions (FSS). 

If the free unknown is the only one in the system, then we assign a value to it 
equal to one. If there are no free unknowns, i.e. r = n, then such a system has only 
a trivial solution and therefore there are no fundamental solutions. 
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We will denote the general solution of the homogeneous system by X gen.: 
X gen. = C1 X1 + CoX2 +--+ + Chr Xn-r, (3.65) 


where C1, C2,..., Cy—, are arbitrary constants. 


Example 3.6 Find the fundamental system of solutions for the homogeneous 
system of equations 


2x1 — 4x2 + 5x3 4+ 3x4 = 0, 
3x, — 6x2 + 4x3 + 2x4 = 0, (3.66) 
4x, — 8x2 + 17x3 4+ 11x44 = 0. 


Solution Find the rank of the matrix of the given system 


2-45 3 
3-6 4 2], (3.67) 
4-817 11 


by bringing it to the upper triangular form. 
Subtract from the third row the second one. As a result we obtain 


2433 
3-6 4 2], (3.68) 
1 —2 139 


permute the first and the third rows: 


1-2 139 
3-6 42). (3.69) 
2-45 3 


Subtract from the second row the first row multiplied by 3, and from the third 
row—the doubled first row. We have 


1-2 13 9 
0 0 —35 —25). (3.70) 
0 0 —21 —-15 
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Divide the second row by (—5), and the third one by (—3): 


1-2 139 
OO) “7 S:h (3.71) 
00 75 


Subtract from the third row the second one: 


1 3:13°9 
6:0 7-5) |. (3.72) 
00 00 


From this we see that the rank of this matrix is equal to two. 
As a basic non-zero minor, take, for example, the minor M; " of the initial matrix: 


53 


‘ (3.73) 
42 


Then, use the first two equations, whose coefficients are included into the basic 
minor. Rearrange to the right side of the equations the summands that are not 
included into the basic minor. We obtain 


5x3 + 3x4 = 4x2 — 2x1, 


(3.74) 
4x3 + 2x4 = 6x2 — 3x1. 
Set two different values to the free unknowns x; and x2. The first case: 
xX] 1 
= . (3.75) 
x2 0 
Substituting these values into the system, we obtain 
5x3 + 3x4 = —2, 
ens (3.76) 
4x3 + 2x4 = —3. 


5 7 
Solution of this system: x3 = 5 and x4 = a 
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The second case: 
= ; (3.77) 
1 


Similarly to the first case, we obtain 


5x3 + 3x4 = 4, 
a (3.78) 
4x3 + 2x4 = 6. 
While x3 = 5 and x4 = —7. 
For the fundamental system of solutions we finally have 
nal 1 0 
x2 0 1 
€ : ; (3.79) 
x3 —5/2 5 
x4 7/2 -7 
The general solution of this homogeneous system may be written as 
0 
0 1 
Xgen. = Cl + C2 : (3.80) 
—5 5 
7 -7 
where C, and C2 are arbitrary numbers. 
3.5 General Solution of the Non-homogeneous System 
of Equations 
Consider a non-homogeneous system of equations: 
ay1X1 + 42%. +++ + ainXn = D1, 
a21X1 + do2x2 + +++ + danxXyn = bo, 
21%1 222 2nXn 2 (3.81) 


Am1X] + Am2X2 + +++ + AmnXn = bin. 
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Theorem 3.5 Let X gen, be the general solution of the homogeneous system, when 
all the values b; are replaced with zeroes, and X spec. is the particular solution of the 
non-homogeneous system. Then, X, the general solution of the non-homogeneous 
system, is equal to 


X= X gen. + X spec. (3.82) 


Example 3.7 Solve the system: 


2x, + x2 — x3 —3x4 = 2, 


4x, +.x3 — 7x4 = 3, 


(3.83) 
2x2 — 3x3 +%x4 = 1, 
2x1 + 3x2 — 4x3 — 2x4 = 3. 
Solution Write the augmented system matrix: 
21-1-3/2 
40 1 -7|3 
(3.84) 
02-3 1 |1 
23-4 -2|3 


Find the rank of this matrix, for which purpose subtract from the second row the 
doubled first row, and from the fourth row—the first row: 


ame ies ee 
ORFs <1 f=] 
(3.85) 
02-3 141 
OS. 281) i 


One can see that the last three rows are proportional to each other, and it is enough 
to keep one of them: 


jan gees ee 
O83 S11 


(3.86) 
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The rank of the basic and the augmented matrices is equal to two, therefore, 


the system is consistent. Find the general solution of the homogeneous system, for 
which purpose rearrange x3 and x4 to the right side of the equations. We obtain 


2x1 + x2 = x3 + 3x4, 


(3.87) 
—2x9 = —3x3 4X4. 
Select the following values for the independent variables: 
x 1 
eye], (3.88) 
X4 0 
in such a case, we obtain the system 
2x, t+x2= 1, 
larg (3.89) 
—2x2 = —3, 
3 1 
whence x2 = =, x] = —-. 
Now, selecting the values 
X3 0 
= ; (3.90) 
X4. 1 
we obtain the system 
2x, + x2 = 3, 
mt se (3.91) 
—2x2 = 1, 
eee 1 7 
whose solution is x2 = —= and xj = -. 
Therefore, the general solution of the homogeneous system has the form: 
1 7 
4 4 
3 1 
X gen. = Ci a> |+ C2 9], where C},C2 € R. (3.92) 
1 0 
0 
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If we introduce for consideration new constants C 1 = C,/4, C5 = C2/4, then 
Xen. Will be written in the form, free from fractions: 


-1 7 


/ 6 / —2 / / 
Xgen, = Cj ‘ +C; ae where C},C, € R. (3.93) 


In order to find the particular solution, return to the augmented matrix (3.86). 
The equations for computation of Xsgpec. have the form: 


2, = 3 2, 
Xy + X2 = X34 3X4 + (3.94) 


—2x. = —3x3+x4-1. 
Assuming that the values of the independent variables are equal to zero, we find 


=-, = —, and 
X{ 4 X2 5) 


Yo = (3.95) 


oO ONIR AI Ww 


As a result, the general solution of the non-homogeneous system is equal to 
-1 7 


6 —2 1 
X= Xeon + Xp = Cj ; +C3 f 7G , where C},C, € R. 


oOo NVM AW 


(3.96) 


Review Questions 


1. Define solution of a system of linear equations. 
2. What system of equations is called consistent? inconsistent? 
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WwW 


. When is the system of equations definite? indefinite? 

. Explain how an augmented matrix of a system of linear equations constructed. 

. Describe the methods of solving the systems of linear equations: Gaussian 
method, inverse matrix method, Cramer’s method. 

. What is the complexity of the Gaussian method? 

. What solution does any homogeneous system of linear equations have? 

. Define the fundamental system of solutions. 

. How is the general solution of a non-homogeneous system computed? 


n 


Oo ON DW 


Problems 


3.1. In order to expand a computer laboratory, its chief is planning to purchase 
9 workstations and 7 notebooks. If 14 workstations and 9 notebooks are 
ordered, the cost of the purchase will grow by 1.5 times. Find how many 
times the workstation is more expensive than the notebook. 

3.2. Solve the system of linear equations using Cramer’s rule: 


2x, -—x2+3x3= = Y, 

On Gain ale? eee ee eT 
2x+7y = 81; 

4x, — 7x2 +%x3= —7; 

x+2y4+z=4, 2x —4y+9z = 28, 

(c)) 3x -—Sy+3z=1, @ )7x+3y—-6z=-l, 

2x +7Ty—z=8; Ix+9y—9z= 5; 

7x + 2y + 3z = 15, x+y+z= 36, 

(e) 5x —3y+2z=15, (f)}) 2x-3z=-17, 

10x — lly + 5z = 36; 6x—5z= 7; 


3x, + 2x2 + x3 = 5, 
(g) x1 +x2— 23 = 0, 
4x1, — x2 + 5x3 = 3. 
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3.3. Solve the system of linear equations using Gaussian elimination: 


6x, + 2x2 + 3x3 = 74, 2x1 + 5x2 — 2x3 = —6, 
(a) Tx; +4x.=91, (6) 4 —3x; —2x2.+23= 0, 
xy + x2 + x3 = 18; 3x2 + 2x3 = —8; 
3x1 —x2 + 6x3 = —4, 5x1 + 3x2 — 3x3 = 8, 
(c) 3x, — 7x2 = 2: (d) —4x) — 3x2 — 2x3 = 1, 
4x1 — 4x2 — 3x3 = —10; —2x; + 3x2 + 6x3 = —29. 
3.4. Solve the system of linear equations using Gaussian elimination: 
2x1 + 7x2 + 4x3 = 32, x, —2x3—x4 = —6, 
2x, + 8x2 — x3 + 7x4 = 63, 5x1 — x2 + 6x3 + x4 = 23, 
(a) (b) 
6x1 + 6x2 + 8x3 — 8x4 = 2, 5x1 — 8x2 — 9x3 + 4x4 = 62, 
6x2 = 4x3 a re 5x4 = 58; 6x1 9x2 5x3 x4 = 73; 
4x, —9x3-—x4 = 37, 8x1 + x2 — 4x3 — 8x4 = 7, 
7x, — x2 —5x3-—5x4= 36, 7x2 —6x3+7x4 = 56, 
(c) (d) 
8x1 — 5x2 + 4x4 = —38, 8x, + 3x2 + 2x3 — 2x4 = —63, 
x1 — 4x2 + 9x3 — 4x4 = —25; 8x, — 3x2 — x3 — 4x4 = 6. 


*3.5. Solve the system of linear equations relative to five unknowns: 


5x1 — 8x2 — 5x3 + 8x4 + 8x5 = —S, 

2x2 + 2x3 4+x4-—X5 = 8, 

(a)  —2x1 + 4x2 + 3x3 — 8x4 + 4x5 = —39, 
5x1 + 6x2 + 2x3 —2x4-4x5= 32, 


xX, — 2x2 —x3+2x4-—2x5 = 23; 


6x, —X2 +6x3+3x4-—7Tx5= 6, 

—4x,; — 4x2 + 3x3 — x4 — 8x5 = —30, 

(b) —x, x2 +5x4— x5 = —22, 
4x, +x2 —3x34+3x4-—5x5= —-3, 


8x2 + x4 — x5 = —Ol. 
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«3.6. Solve the system of linear equations relative to six unknowns: 


3.7. 


3.8. 


3.9. 


—x, + 2x3+5x4-2x5-4x65= 4, 
—x, — x2 +x3—3x4+ 4x5 — 4x6 = —46, 
—x1 + 5x2 + 4x3 — 2x5 — x6 = —19, 
A xy — 2x2 + 4x3 -—2x5-—x6 = —9, 
—2x, — 5x2 + 3x3 — 2x4 — 3x5 = —20, 
3x, t+x24+3x34+4%5= 4; 
—2x, — 4x2 —4x5+5x6= 2, 
—x, + 5x2 + 3x3 + 5x5 = —12, 
(b) —2x, — 5x2 + 5x3 — 3x4 — 5x5 —3x6= —7, 
—4x1) — 5x2 — 3x3 + 5x4 —2x5+3x6= 10, 
xy + 3x2 — 5x3 4+ 4x4 + 3x5 + 2x6 = 8, 
—4x, +x — 4x3 4+ 3x4 +45 -— x6 = 3. 


Find the fourth power polynomial p(x) with real coefficients, for which the 
following properties are valid: p(5) = 1 and p(1) = p(2) = p@) = 
p(4) =0. 

At what values of the parameter A is the system of equations 


xp tx2 = 1, 
Ax, +x2 = 2, 
xX, + Ax2 = 4. 


consistent? 
At what values of the parameter 4 does the system of linear equations 


Xj +Ax2= 0, 
—xj,+x2-213= 5, 
—x2 +43 = —4, 


have the unique solution? With these values of A, find the solution of the 
system using Cramer’s rule. 
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*3.10. Solve the matrix equation A - X . B = C, where 


—323 ia —130 
A=|-214|,B= a ,C=)}] 250]. (3.97) 
6 12 —33 0 


Note the degeneracy of one of the multipliers, namely the matrix B. 
*3.11. Solve the matrix equations: 


62 -44 
(1) X= 
31 —22 
15 —5 1 -l1 
(2) X = ; 
—3 1 —2 1 
6 -1 1 6 19 —22 
3) X-|-10 -5 6|=]10 5 -6|; 
0 —2023 2621992 
-12 5 —7 8 2 
4) } 5 3 -1}-X=]}2 71 
7 -1-11 1 -11 


3.12. A program code is given that processes the one-dimensional array a, 
consisting of five elements: 


for i in range(len(a)): 
temp = a[0] 


for j in range(len(a) - 1): 
a[j] = -2 *« alj + 1] 
a[len(a) - 1] += temp 


After executing this program code segment, the array a[] consists of 
the following elements: [-32, 32, 32, 32, 16]. Find what values 
the elements of the array a[i], where i = 1,...,5, before executing this 
segment. 

3.13. A program code is given that processes the one-dimensional array a, 
consisting of seven elements: 


for i in range(len(a)): 
temp = a[0] 
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for j in range(len(a) - 1): 
alj] = 3 *« alfj +1] - 1 


allen(a) - 2] = -a[len(a) - 2] - temp 


After executing this program code segment, the array a [] consists of the 
following elements: [365, 608, -769, -499, -409, 107, 5]. 
Find what values the elements of the array a[i],i = 1,...,7 took before 
executing this segment. 

3.14. There exists a modification of Gaussian elimination referred to as Gauss— 
Jordan? elimination. In the Gauss—Jordan method, the coefficient matrix 
is brought not to a triangular, but to a diagonal form. Write the realization 
of this method in Python and compare its asymptotic complexity with the 
complexity of the standard Gaussian elimination. 

3.15. Find the general solution and the fundamental system of solutions for the 
systems of equations: 


x1 —440 + x3 = 0, 2x, — X2 +3x3+ x4 = 0, 
(1) xp tx2-23 =0, (2) 2x, — 5x2 — x3 = 0, 
3x1 — 2x2 — x3 = 0; 4x, — 7x2 +x3 + 3x4 = 0; 
xy + 2x2 + 4x3 — 3x4 = 0, 3x, + 5x2 + 2x3 = 0, 
3x, + 5x2 + 6x3 — 4x4 = 0, 4x; + 7x2 + 5x3 = 0, 
(3) (4) 
4x, + 5x2 — 2x3 + 3x4 = 0, xy + x2 — 4x3 = 0, 
3x1 + 8x2 + 24x3 — 19x4 = 0; 2x1 + 9x2 + 6x3 = 0; 


2x1 + 4x2 + 6x3 + x4 = 0, 
x+2y+3z=0, 

x1 + 2x2 + 3x3 4+ x4 = 0, 
(5) (6) } 2x + 3y+4z=0, 

3x; + 6x2 + 9x3 -— x4 = 0, 
x+y+z=0; 


Xx) + 2x2 + 3x3 + 5x4 = 0; 


Xx, — 2x2 + 3x3 — 4x4 = 0, 
7) 2x1 — 4x2 + 5x3 4+ 7x4 = 0, 
6x1, — 12x2 + 17x3 — 9x4 = 0, 


7x, — 14x20 + 19x3 + 17x4 = 0. 


5 Wilhelm Jordan (1842-1899), German geodesist and mathematician. 
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Answers and Solutions 


3.1 Solution. 

Denote the price of one workstation by x, and of one notebook—by y. Assume 
that the cost of 9 workstations and 7 notebooks is a. Then, we obtain a system of 
linear equations relative to the unknowns x and y: 


9x+7y=a, 
3 
14 oy = =a. 
x + 9y 34 
aa 3 1 es ‘ 
Its solution is x = —a, y = eae Therefore, the workstation is three times more 


expensive than the notebook. 
3.2 Solution. 


(a) The augmented matrix of the system has the form 


3 —5 | 13 
27 |81 
Compute the required determinants: 
3 —5 
A= = 214 10= 31, 
27 
13 —5 
Ay = = 91+ 405 = 496. 
81 7 
Ax 496 
Therefore, x = — = — = 16. 
A 31 
3 13 
Ay= = 243 — 26 = 217, 
2 81 
— Ay 217 | 
31 
(b) The augmented matrix of the system has the form 
2-13 | 9 
3 —5 —1|-—10}. 
4-7 1 | -7 
2-1 3 11 —16 0 


A= |3 —5 —1| =2@}3 5 _1] =11-(-12)+ 16-7 = —20, 
AF 1 4-7 1 
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(c) 
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9 -1 3 —21-16 0 
Ai = |—10 —5 —1 —(1)+3(2) =f =5) 1 —(3)+(2) 
-7 -7 1 —-7 —7 1 
—21-16 0 
= ]-10 —5 —1| = 252 — 272 = —20, 
—17-12 0 
Ai —20 
xy, = SS > — = 1, 
A —20 
2: Qo 3 11 —21 0 
Ad = 3-10 -1 —(1)+3(2) 3 -10-1 —(3)+(2) 
4-7 1 4 -7 1 
11 -—21 0 
=}|3 —10—-1| = —(-1)(-187+4 147) = —40, 
7 -17 0 
A —4 
plea Et Fem 
A —20 
2-1 9 0 -1 9 
AG S85 10) SP | ys 10 
4-7 -7 —10-7 -—7 
= (49 — 100) + 9(49 — 50) = —60, 
A3 —60 3 
n= > ——_— = ‘ 
to e200 
The augmented matrix of the system has the form 
12 144 
3-5 3 |1 
27 -1(|8 
1.52, oI 02 1 
N= 19 =§.-3.) SUP lp 5: 9 |= 3 (64 5) = 33, 
27-1 3 7 -1 
42 1 42 1 
Ay=|1-5 3 | =O? 11-5 3 | =4-(15—9) — (-6— 3) = 33, 
8 7 -l 0 3 -3 
Ay 33 
x= — 1, 


™ = 33 — 
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141 10 1 
Ay =|31 3 | =P44 13 -11 3 | =-11-(-1— 2) = 33, 
28-1 20 -1 
Ay 33 
=—=—e=i1, 
eee lane 
L 24 124 
Az = |3 -§ 1] =O 13 —5 1] = -3- (1 — 12) = 33, 
278 030 
A, 33 
(d) The augmented matrix of the system has the form 
2-4 9 | 28 
73 -6|-1 
79 -9| 5 
2-4 9 2-4 9 
A=|7 3 —6| =@-@® |7 3 —6] = 2(—9 + 36) — 7(12 — 54) = 348, 
79 -9 06 -3 
28 —4 9 335 0 
Ay =|-1 3 —6| =+® |_1 3 —6] = 6(297 — 25) — 9199 + 5) = 696, 
5 9 -9 5 9-9 
696 
x= — = 2, 
348 
228 9 228 9 
Ay = |7 -1 -6| =@-® |7 -1 —6] = 2(3 + 36) — 7(—84 — 54) = 1044, 
715-59 06 -3 
_ 1044 
a age oe 
2 —4 28 2 —4 28 
A, =17 3 —1| =9-@ |7 3. 1] = 2018+ 6) — 7(—24 — 168) = 1392, 
7 Os 06 6 
1392 
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(e) The augmented matrix of the system has the form 


22. B15 
5 -3 2|15 
10 —115 | 36 
7 2 3 72 3 
A=|5 —3 2) =9-?@ 5 3 2| = 5(14— 15) + (-21 — 10) = —36, 
10-115 0-51 
15 2 3 15.2 3 
Ag 153 9) SOaO 1 O52) SH 72, 
36 —-115 36-11 5 
—72 
i= = Sh 
—36 
7 153 7 15 3 
Ay=]5 152), =@-® |_2 9 —1] = 2(75 — 108) + (252 — 150) = 36, 
10 365 10 36 5 
Da —36 ’ 
7 2 15 
A, =|5 —3 15| = —36, 
10 11 36 
—36 
z= — = J 
(f) The augmented matrix of the system has the form 
11 1 | 36 
20-—3|-17].- 
60-5| 7 
111 
A= |20 —3} = —(—10+ 18) = —8, 
60-5 
36 1 1 
Ax = |-17 0 —3} = —(85 + 21) = —106, 
7 0-5 
—-106 53 
x= — = —. 


—-§ 4 
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1 36 1 1 36 1 
(3)—-3(2) 
Ay = |2 -17 —3] =@)-2() |0 —89 —5] = —356 + 290 = —66, 
6 7 —5 0 58 4 
_ 66 33 
Be tam ae 
11 36 
A, = |20 —17) = -d4+ 17-6) = —116, 
60 7 


_ 116 _ 29 


- 2 
(g) The augmented matrix of the system has the form 
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32 1145 
Lb pb: 
4-15 |3 
4°94 1 13 
fat S ene 5-14 
52 1 523 
Ai=|o 1 =1/ =8F Pl lo 1 0] =20-9=11, 
515 3-14 
US ae , 
3.5.4 45 1 
Ao = |10 —1) ="4 81 19 9 -1] = 12 — 45 = —33, 
435 935 
233 
= 3: 
aces 
32 5 1129. 5 
A3=/1 1 of SUP lo 1 oO] = 3-25 = —22, 
Aa een De) 
wy, 
x3 — =2 
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3.3 Solution. 
(a) Execute the following elementary transformations of the augmented system 
matrix: 
623 |74 (1)-6(3) 0 —4 —3 | —34 
740/91 | SOO) | 037 35 | 
111|18 11 1 {| 18 
012 9 | 102 0 0 —19| —38 
> 10-3 —7 | —35 > D+42) | 9 _3 _7 | 35 |. 
ie ae a it. 1 [a8 


38 
Hence it follows that x3 = 19 = 2; 
—3x2 = —354 14= —21, x2 =7, 
xy =18-2-7=9. 
The final answer is [x1, x2, x3]? = [9, 7, 2]”. 
(b) Execute the following elementary transformations of the augmented system 


matrix: 
2 5 —2|-6 2 5 -2|-6 
-3-21| 0 | >? |-6-4 2 | 0 | -O#O 
0.3 2 |(=8 0. 3 2) =8 
25 -2| -6 25 -2| -6 

> 1011-4; -18| 1! | 011-4] -18 | ~O3® 
03 2|-8 0 33 22 | —88 
25 -2| -6 

> 1011 -4|-18]. 
0 0 34 | —34 


—34 
Hence it follows that x3 = = =-l, 


llx2 = -18 —4= —22, x2 = -2, 
2x, = -6-24+10=2,x, =1. 
We obtain the answer: [x 1, x2, x3)" = [1, -2, if, 
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(c) Execute the following elementary transformations of the augmented system 


matrix: 
3-1 6 | -4 3-16 | -4 
2270 2 (oP | 0-626) 6 | 9" 
—4 -4 -3| -10 —4—4 -3| -10 
12 —4 24 | -16 0 —16 15 | —46 
>] 0-6-6) 6 |=?) 0 -6 -6| 6 |=? 
—4 —4 -3 | -10 —4 -4 -3|-10 
0 -16 15 | —46 QSICUS esol sce 
>] 0 -48-48) 48 | ~P°PM 1 0 0 —93| 186 | OW 
—4 -4 -3 | -10 —4-4 -3 | -10 


—4 -4 —3 |-10 
>] 0 -16 15 | —46].- 

0 0O —93 | 186 

Hence it follows that x3 = —2, 

—16x2 = —46+4+ 30 = —16, x2 = 1, 

—4x, = -10-6+4= —-12, x; =3. 

We obtain the answer: [x1, x2, x3]? = [3,1, —2]", 

(d) Execute the following elementary transformations of the augmented system 


matrix: 
5 3 -3| 8 5 3 -3| 8 
-4-3-2| 1 | 7%?) 0 -9-14| 59 | 37 
-2 3 6 |-29 -23 6 |-29 


10 6 m9 0 21 24 |-129 

1 
0 -9-14| 59 |} > MPO | 9 9-14] 59 | >3 
~29 -23 6 | -29 


=z. 3 —29 —2 3 6 | —29 


07 8 |-43 (1)<9(3) —23 6 | —29 


0 0-26| 26 | ~-9°@!] 07 8 | -43]. 


—43 0 7 8 |-4 
>| 0 -9-14| 59 | >7%=]| 0 —63 -98| 413 | OHM 
es 629 0 0-26| 26 
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Hence it follows that x3 = —1, 
7x2 = —434+ 8 = —35, x2 = —5, 
—2x, = —29+64+ 15 = —-8, x; =4. 


We obtain the answer: [x1, x2, x3)" = [4, —5, 1)". 
3.4 Solution. 


(a) Execute the following elementary transformations 
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of the augmented system 


52-2) 


191 


-1 
58 


—» (4)—25(2) 


matrix: 
—27 4 0 | 32 0153 7 
2 8-1 7 | 63 _ +0) 2 8 -1 7 
—-33 4 -4| 1 —33 4 -4 
0 6-4 5 |58 0 6 -—4 5 
2 8 -1 7 | 63 28 -17 
_ 015 3 7 |95 _214)43(1) 015 3 7 
0 6 -—4 5 | 58 06 -—45 
—-3 3 4 -4| 1 030 5 13 
28 -1 7 | 63 28-1 7 | 
(2)—(4) 
=n 030 6 14| 190 _, 56) 00 1 1 | 
06 —4 5 | 58 06-4 5 | 
030 5 13|191 00 25 —12 | —99 
28-1 7 | 63 
06-4 5 | 58 
er: ? 
oo 1 1 |-!I 
00 0 —37 | -—74 
—74 
x — 
4 37 . 
43=-1-2=-3, 
6x2 = 58 — 12 — 10 = 36, x2 = 6, 
2x; = 63-48 —3-14=-2,x,;=-1. 


Write the answer: [x1, x2, x3, x4]? = [-1, 6, -3, 


ar. 
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(b) Execute the following elementary transformations of the augmented system 


matrix: 
1 0 -2-1|-6 10 21/6 
“5-16 1123) acy f-5-1 6 1123] aie 
5 -8-9 4 | 62 5 —8 —94 | 62 
6 -—9-5 1 | 73 6 —9 —-5 1 | 73 
1021/6 10 2 1 | 6 
ete: 2)+5(1 ie 
5-16 1/23] Brey ;O-1 16 6 153] wo) 
0 —9 —35| 85 0-9 —3 5 |85 
6 -9-51|73 D017 = 5 (37 
PO> oe AG 10 2116 
= O-1 16 6 | 53 4) 0-1 16 6| 53 —s 3)-9(2) 
0-9 -3 5 | 85 0 —9 —35 | 85 
0:0: =14 10) +48 00 7 5|24 
Lo 2 ape 150: OS 75(26 
xy) | Or 20 T) 598 |) ah aay OT BO 26) 935 aay ea) 
0 0 —147 —49 | —392 00 0 56| 112 
00 #7 5 | 24 00 7 5 | 24 
102 1{1{ 6 
0-116 6 | 53 
> 
00 7 5 | 24 
00 0 56| 112 
112 
x4 = — =2, 


7x3 = 24-— 10= 14, x3 = 2, 
—x2 = 53 — 12 — 32,x2 = -9, 
x) =6-—2-4=0. 


Write the answer: [x], x2, x3, x4]! = [0, —9, 2, 2]” 
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(c) Execute the following elementary transformations of the augmented system 


matrix: 


(1)-4(4) 
-1(4) 
4 


(2) 

40 -9-1| 37 G8) 1-4 9 -—4|-—25 

ese a (<e(4) = 

7-1-5-5| 36 | Bow | 016 -45 151 137) ry 
8 =) 0. 4. [38 0 27 —68 23 | 211 

baa ea, 25, 0 27 —72 36 | 162 
1=<4..9 4] —95 14 9 —4 [—25 

0 16 —45 15 | 137} (so) | 0 16 -45 15 1 137] 3.) 
0 27 —68 23 | 211 00 4 —13| 49 

03 -8 4 | 18 03 -8 4 | 18 
1-4 9 —4 | -25 P49 4:05 
048 -135 45 | 411} yoy | 9 9 -7-191 123 | gia) 
Oi! © 2A 3 |) AO 00 4 —13| 49 

Ti at a) 03-8 4 | 18 
ie 9 —4 | —25 ie 9 —4 | —25 

_28 — (2)+7@3) - 

0 0 -28-76| 492} Or7G) )0 3-8 4 | 18 

00 4 —13| 49 00 4 -13 | 49 

03 -8 4 | 18 00 0 —167| 835 

835 

x4 = —— = -5, 


4x3 = 49 u 65 = —16, x3 = —4, 
3x2 = 18 — 32+ 20= 6, x2 = 2, 
x} = —25+8+36—20=—1. 
Write the answer: [x1, x2, x3, x4]? = [—1, 2, —4, —5]”. 
(d) Execute the following elementary transformations of the augmented system 


matrix: 
—8 1 -4-8]| 7 —8 1 -4-8| 7 i 
ig a (3)-() pany pee = 
0 -—7-6 7 | 56 ee 0 -—7-6 7 | 56 2° 
—-8 3 2 -—2| -63 0 2 6 6 |-70 
8 —3 -—1 -—4| —6 0-4 3 4 |-13 
—8 1 -4-8| 7 —81-4-8| 7 
in, a QN+73 = 
0 -—7-6 7 | 56 L4G) 0 015 28 | —189 _,4)-0) 
0 1 3 3 |-—35 013 3 | —35 


0-43 4 |-13 0 015 16 | —153 
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Pad 2a. A | 


0 015 28 | —189 


=> 
013 3 | —35 
000 —12| 36 
36 
= —> = 3, 


15x3 = 84 — 189, x3 = —7, 
x2 = —-354+9+4+21=-—5, 
—8x, = 7-24-2845 = —40, x; =5. 
Write the answer: [x1, x2, x3, x4]! = [5, —5, —7, —3]”. 
3.5 Answer: 
(a) [x1, X2,X3,%4, x5]? = [6, —6, 7, 0, —6]'; 
(b) [x1, X2,%X3,%4, x5] = [6, = -1, 1, Ae 
3.6 Answer: 
(a) [x1, x2, 43, x4, x5, x6)" 


(b) [x1, x2, x3, x4, x5, x6]? 


3.7 Solution. 
The fourth power polynomial with real coefficients can be presented in the form: 


[5, 0, 5, 5, 5,4)" % 
[1, —2, —2,0, 1, o]”. 


p(x) = aax* + a3x° + anx? + a,x + ao, 


where ao, a1, ..., a4 € R are the unknown coefficients. We obtain system of linear 
equations relative to the coefficients a;: 


5taq + 5?a3 + 57ay + Sa; +9 = 1, 
Vag + 13.43 + a+ lajy+ ayo = 0, 
24a4 + 23a3 + 2742 + 2a1 + ap = 0, 
3444+ 33.43 + 32a7 + 3a, + dag = 0, 
Vag + Baz + 4a, + 4a; +.ap = 0. 


Write the matrix of this system and bring it to the echelon form: 


625 125 2551 |1 111 11/0 
1 1 111/0 08121415 |0 
146 8 421/0}/~7 1/006 9 10/0 
81 27 9 31/0 00 0 2450|0 
256 64 1641|0 00001 {1 
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we find the values of a;: 


ag= 1, 

_ 50 
a4+a3+a2+a,+a9 = 0, a1=- 
8a3 + 12a. + 14a; + 15a9 = 0, 35 
6a. + 9a; + 10a9 = 0, 3 42 94” 

= 10 
24a; + 50a = 0, ay = =a 
ag = 1; 

1 
a4 = a. 

24 


As a result, the sought-for fourth power polynomial is equal to 
LA 3 2 
p(x) = mae — 10x" + 35x* — 50x + 24). 


3.8 Solution. 

The system of linear equations is consistent if and only if the rank of the system 
matrix is equal to the rank of the augmented system matrix. 

Find the rank of the basic matrix: 


11 

A=]iA1 

11 
1 1 

(2)—A(1) 
A— (3)-() 01—-2|—> (3)+(2) O1—2 
OA-1 0 0 
Therefore, 
2, ifA Fl, 
tkA = 7 


_ 


, fA=l1. 
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Compute the rank of the augmented matrix: 


eae td) i a 
(2)—-A(1) 
14/4 OA-1| 3 
1 1 | 1 
> ]O01-A|[2-Aa 
0 0 |5-A 
Thus, 
2, ifAaA=5, 
rk (A|B) = 
3, ifA4S. 


The condition rk A = rk (A|B) is satisfied at 1 = 5. Therefore the system is 
consistent at A = 5. 


3.9 Answer: 

The system has the unique solution at A # 0. For such values of A the solution of 
the system has the form x; = —1, x2 = aes B= ala, 
3.10 Solution. 


Since the matrix A has the size 3 x 3, and the matrix B has the size 2 x 2, then 
the unknown matrix X can be represented in the form: 


ab 


X= |cdJ|, wherea,b,c,d,e, f are some real numbers. 
ef 


Having substituted this matrix into the equation A-X-B = C, we obtain a system 
of linear equations relative to the unknowns a, b,..., f: 


3a + 3b — 2c — 2d — 3e — 3 f = -13, 
2a+2b—c d —4e—-—4f = —25, 
6a — 6b —- c d —2e-—2f = —33. 
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Bring the augmented system matrix to the echelon form: 


a Saag 9] 18 
M=|2 2 -1-1-4-4|-25|] > 
6al = 9 = | S88 


33 —2 —2 —3 —3 | -13 
> ]00 1 1 —6—-6| —49 
000 0 1 1{ 8 


As the basic minor we select, for example, M i a ie # 0. Hence, b,d, f are the 
independent variables by which the variables a, c, e are expressed: 


3—b b 
X=|-1—ddj|, whereb,d,f eR. 
Tien ag 
3.11 Answer: 
a b 
() x= , where a,b € R; 
—2—3a2-—3b 


(2) X € ©, i.e. there are no solutions; 
5a 3(a— 1) -a—4 
(3) X= 5 5d 3d—5 —d |, wherea,d,ge€R,; 


5¢3(g+1) 4-8 
(4) Xe@. 


3.12 Solution. 

Denote the initial values of the elements of the array a[] by a[1],a[2],..., 
a[5]. After executing the program segment code, the array a[] will contain the 
following elements: 

16a[1]+16a[5], 

-8a[1]+16a[2] -8a[5], 

4a[1] -8a[2]+16a[3]+4a[5], 

-2a[1]+4a[2] -8a[3]+16a[4] -2a[5], 

a[1]-2a[2]+4a[3]-8a[4]+17a[5]. 
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We obtain system of linear equations with five unknowns. Its solution is a[1] = 
—4, a[1] = 1, a[1] = 3, a[1] = 3, a[1] = 2. Hence, the initial array has the form 
[—4, 1, 3, 3, 2]. 


3.13 Answer: [—14, —5, —2,1,1,1, 5]. 
3.14 Solution. 


import math 


def gauss jordan_elimination(A, B): 
m = len (A) 
n = len(A[0]) 


if len(B) != m: 
raise ValueError 


C = [[A[i][j] if j != n else B[il] 
for j in range(n+1)] for i in range(m) ] 


for r in range(n): 
max row pos = r 


# Pivoting strategy 
for i in range(r +1, n): 
if abs(C[i] [r]) > abs(C[max_row_pos] [r]): 
max row pos = i 


C[r], C[max_row_pos] = C[max_row_pos], Cl[r] 


if math.isclose(C[r] [r], 0): 
continue 


for i in range(n): 
factor = C[li][r] / Clr] [r] 


for j in range(n + 1): 
if i != randj i !=r: 
C[il [fj] -= factor * Clr] [j] 


for i in range(n): 
a Ves 8 


Cfi] [r] = 0.0 


for j in range(n + 1): 
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zg |e 


C[r] [3] /= Clr] [r] 
Chri tr] = 1...0 
answer = [0] *«n 


for i in range(n): 
if not math.isclose(C[i] 
answer[i] = C[i] [n] 
elif not math.isclose(C[i] 
return None 


»_™~N A 


return answer 


Let us give an example of a call of the function gauss _jordan_ 
elimination (): 


size = 100 


A = [[0 for j in range(size) ] 
for i in range(size) ] 


B = [0 for i in range(size) ] 


for i in range(size): 
for j in range(size): 
Alil[j] = 1 if j == i else 0 


for i in range(size): 
Bli] = float (i) 


print (A) 
print (B) 


print (gauss jordan _elimination(A, B)) 


The asymptotic complexity of the Gauss—Jordan method coincides with the 
complexity of Gaussian method and is equal to O(n), where n is the number of 
equations of the initial system of equations [58]. 
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3.15 Solution. 


(1) Denote the system matrix by A and bring it to the upper triangular form. 


1-41 ay (toot 1-4 1 
A=/]1 1 -1]/ > @30]9 5 -2} 797219 5 _2 
39 -1 0 10 —4 00 0 


The rank of the matrix is equal to two, and the number of variables is equal to 
three. This implies that the system will have 3 — 2 = 1 free variables. 
Write the resulting equations: 


x1 — 400 + x3 = 0, 
5x2 — 2x3 = 0. 


As the independent variable, select x3: 


3 
xX, = =X3, 
1 re 
2 
x2 = =x 
2 Pies 


As a result we obtain the fundamental system of solutions: {[3, 2, 57}. 
(2) Write the system matrix A and bring it to the upper triangular form. 


og Ae ms es aa | 
(2)-() 
A=|2-5-10] ~ ®-2@ | 9 —4 —4 -1] ] 3 71@ 
4-7 1 3 0-545 1 
es ee a PATA A 
slo4a 41) >4@4@]o 4 41 
0=5=5 1 0009 


It is clear that the rank of the matrix is rk A = 3, the number of variables is 
equal to 4. Therefore, the system will have 4 — 3 = 1 free variables. Write the 
resulting equations: 


2x1 — x2 +3x3+x4 = 0, 
4x9 +4x3+ x4 = 0, 
9x4 = 0. 
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As the independent variable, select x3: 


x, = —2x3, 
x2 = —Xx3, 
x4 = 0. 


The fundamental system of solutions is {[—2, —1, 1, 0]"}. 


(3) Write the system matrix A and bring it to the upper triangular form. 
12 4 -3 12 4 -3 
G3) 
ied 35 6 —-4 _, B40) 0-1 -6 5 
45-2 3 0-3 —-18 15 
3 8 24 —19 03 18 —15 


Note that the second, third and the fourth rows are proportional: 
12 4 -3 
0-1-6 5 


The rank of the matrix rk A = 2, the number of variables is 4. Therefore, the 
system will have 4 — 2 = 2 free variables. Write the resulting equations: 


xy + 2x2 + 4x3 -— 3x4 = 0, 
—x2 — 6x3 + 5x4 = 0. 
As the independent variables, select x3 and x4: 
xX, = 8x3 — 7x4, 


x2 = —6x34+ 5x4. 


The fundamental system of solutions is {[8, —6, 1, Oo], [—7,5, 0, qe 
(4) Write the system matrix A and bring it to the upper triangular form. 


352 ‘REA 11-4 
arty 
gle? esmSen [eS Nay OS SP | | saa=@) 
ee 35 2 0214 


29 6 29 6 07 14 
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a ] 

03 21 03 21 

ae _, 38)-202) 
0214 00 0 
05 0 05 0 


The third row entirely consists of zero elements and can be eliminated from the 
system matrix. 


11-4 ri4 ieee 
0321} >? J05 0 | 3°? 105 0 
05 0 03 21 0.0 105 


The rank of the matrix rk A = 3, the number of variables is 3. Therefore, the 
system has no free variables. Write the resulting equations: 


Xx, + x2 — 4x3 = 0, 


5x2 = 0, 
105x3 = 0. 
Therefore 
x; = 0, 
x2 = 0, 
x3=0. 


As a result, the system has only the trivial solution: [0, 0, oy’. 


(5) Write the system matrix and bring it to the upper triangular form. 
246 1 123 0 123 0 
G30) 
re 123 1 _, ()-@) 123 1 ebay} 000 1 
369-1 369-1 000-1 
123 5 123 5 000 5 


The second, third and fourth rows are proportional: 


1230 
0001 


170 


(6 


wm 
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The rank of the matrix rk A = 2, the number of variables is equal to four, 


therefore, the system will have 4 — 2 = 2 free variables. Write the resulting 
equations: 


xy + 2x2 + 3x3 = 0, 


x4 = 0. 
Therefore 


xj = —2x2 ro 3x3, 


x4 = 0. 


The fundamental system of solutions is {[—2, 1, 0, 0], [—3, 0, 1, O]7}. 
Write the matrix A and bring it to the upper triangular form. 


125 12 3 
(2)-2(1) 

A=|234|] 7 @-@ |0-1-2 

es a 0-1-2 


The second and the third rows are proportional: 


12 3 
0-1-2 
The rank of the matrix rk A = 2, the number of variables is equal to three. 


Therefore, the system will have 3 — 2 = | free variables. Write the resulting 
equations: 


x+2y+3z=0, 
—y—2z=0. 


Therefore 


The fundamental system of solutions is {[1, —2, 1]7}. 


Answers and Solutions 171 


(7) Write the system matrix A and bring it to the upper triangular form. 


eee ee oS 4 
G)6t1) 

2-45 7]  @-S))0 0 -115 

6 -12 17-9 00 -1 15 

7 =14 19-17 0 0 —2 45 


The second and the third rows are proportional: 


1-2 3 -4 1-2 3 -4 
00 -115|> ©? 0 0 -1 15 
0 0 —2 45 00 O 15 
The rank of the matrix rk A = 3, the number of variables is equal to four. 
Therefore, the system will have 4 — 3 = | free variables. Write the resulting 
equations: 
x, — 2x2 + 3x3 —4x4 = 0, 
—x34+ 15x4 = 0, 
15x4 = 0. 
Therefore 
xX} = 2x9, 
x3 = 0, 
x4 = 0. 


The fundamental system of solutions has the form {[2, 1, 0, 0} }. 


Chapter 4 ®) 
Complex Numbers and Matrices od 


As was already mentioned in Chap. 1, complex numbers may appear as matrix 
elements. Moreover, the characteristics of real matrices (such as eigenvalues, see 
Chap. 5 “Vector Spaces” on page 226) in some cases appear to be complex. In this 
connection, let us discuss the methods of algebra of complex numbers. 

Complex number z is an ordered pair of real numbers (a, b), where a,b € R. 
The first number a is called the real part of the complex number z = (a, b) and 
is denoted by symbol Rez, while the second number of the pair b is called the 
imaginary part z and is denoted by Imz [24]. 

A complex number of the form (a,0), where the imaginary part is zero, is 
identified with the real number a, i.e. (a, 0) = a. This allows considering the set of 
all real numbers R as a subset of a set of complex numbers C. 

Two complex numbers z; = (a1, 1) and z2 = (a2, b2) are considered equal if 
and only if their real and imaginary parts are pairwise equal: z} = z2 > a, = a2, 
by = bo. 


4.1 Arithmetic Operations with Complex Numbers 


On the set C are defined the operations of addition and multiplication of complex 
numbers. Sum of complex numbers z; = (a, b1) and z2 = (az, b2) is the complex 
number z, equal to z} + Z2 = (a1 + a2, bj + b2). Product of numbers z; = (a1, bj) 
and z2 = (a2, b2) is such a complex number z = (a, bd), that a = ajaz — bybo, 
b= a,b2 + azb. 

The pair (0, 1) is of the greatest importance in the operations with complex 
numbers; it is denoted by (0,1) = i and is called imaginary unit. The basic 
property of the imaginary unit consists in that i? =i-i = (0,1)- (0, 1) = (-1,0), 
or i* = —1. 
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A complex number of the form z = (0, b) is called purely imaginary. Since 
(0, b) = (6,0) - (0, 1), then the purely imaginary number z is presentable in the 
form of the product z = bi. 

Any complex number can be presented in the form 


z= (a,b) = (a,0) + (0, b) = (a, 0) + (b, 0) - (0, 1) =a + ib. 


Such a notation is referred to as the algebraic form of a complex number. This 
allows considering i as a factor, whose square is equal to —1, and performing oper- 
ations with complex numbers in the same manner as with algebraic polynomials, in 
intermediate calculations assuming i? = —1. 


Example 4.1 Let zj = 2+ 5i, z2 = —3 + 2i. Then, the addition of these numbers 
will result in a complex number 


Ztz2=(24+5i1)+(-34+ 21) = 2@-3)+(56+2)i =-14+7i. (4.1) 


The product of the numbers z; and z2 is computed by multiplying the expressions 
(2 + 5i) and (—3 + 2i) as polynomials with regard to the equality i7 = —1: 


z1Z2 = (2+ 5i)(—3 + 27) = 2(—3) + 2(27) — 3(57) + (51) (21) 
= —64 4i — 15i + 10i7 = —6 — 104 (4— 15)i = —16— 11i. (4.2) 


A complex number z* = (a, —b) = a — ibis called a conjugate of the complex 
number z = (a, b) = a+ib. There is one more frequent notation of a conjugate—z. 
If the coefficients of the polynomial p(z) are real, the equality (p(z))* = p(z*) is 
valid. 


It is convenient to present the number z = a + ib as the point (x, y) of a plane 
with Cartesian coordinates x = a and y = b. Correlate each complex number z 
with a point with coordinates (x, y) (and a position vector, connecting the origin 
of coordinates with this point). Such a plane is denoted by © and is referred to 
as complex plane (see Fig. 4.1). Note that geometric interpretation of complex 
numbers is sometimes referred to as the Argand! diagram. 

Many applications widely use a trigonometric form of the complex number z. 
Let us introduce the polar coordinate system so that the pole is at the origin of 
Cartesian system (x, y). The axis of the polar system will be directed along the 
positive direction of the axis Ox. 


'Jean-Robert Argand (1768-1822), French mathematician. 
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Fig. 4.1 Representation of yt 
the number z on a complex @ 
plane 


b=Imz 


In this case, Cartesian and polar coordinates of an arbitrary point other than the 
origin of coordinates are related by the formulae 


X=pcosy, y=psing, 


arctan = if x > 0; 
x 


arctan > +z, ifx <0, y>0; 


p=yx2+y2,  g= 7 
arctan——z, ifx <0, y <0; 
x 


a if 0 
—signy, ifx =0. 
>) eny 
As a result, we obtain a trigonometric form of the number z 
Z=(x,y)=x+iy = p(cosg+ising). 


The value p is called modulus, and g—argument of the complex number z and 
denoted p = |z|, g = arg z. It should be noted that the argument g is ambiguously 
determined: instead of the value g we can take the value yg + 27k, where k € Z. If 
arg z is chosen in such a way that —m < argz < mz, then such a value is called the 
principal value of the argument. 

For the numbers z; = 1 (cosy; + ising) and z2 = p2(cosg2 + ising?), 
specified in the trigonometric form, 


2122 = pip2(cos(y1 + g2) + i sin(y1 + ¢2)), 


Z1 Pl 50 
— = —(cos(y} — 92) +i sin(y| — Y2)), p2 #0. 
22 p2 
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© 


Fig. 4.2 Sum is the panel (a), and product is the panel (b) complex numbers z; and z2 


Geometric illustration of the sum and the product of complex numbers is shown 
in Fig. 4.2. For any z1, z2 € C, the position vector of the sum z; + z2 is equal to 
the sum of the position vectors of the summands z; and z2. The position vector of 
the product z;z2 is obtained by rotating the position vector of the number z; by the 
angle of arg z2 counter-clockwise and extending by |z2| times. 

Euler’s’ formula relates the exponential function of the imaginary argument 
with trigonometric functions of the imaginary part of the argument: 


e'? =cosg+ising. 


This is why we can introduce one more notation of the complex number, namely, 
exponential: z = pe'?. The exponential notation is convenient for operations of 
multiplication, division, raising to a power and extraction of root. For example, the 
n-th power of the number z can be presented in the form 


z” = (pel?)" = pe"? = p"(cosng +i sinng) 


for all integer values n. 
An important consequence of the obtained formula 


(cosy +ising)” = cosng +i sinng 


is associated with the name of de Moivre. 


The n-th root of z = p(cosg +i sing) can be calculated as 


“/z = z!/" — [p(cos(y + 2xk) +isin(y + 27k)", keZ, 


2Leonhard Euler (1707-1783), prominent Swiss mathematician. 
3 Abraham de Moivre (1667-1754), English mathematician of French origin. 
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or, after applying Euler’s formula, 
2k 2k 
z= p'!"( cos (2**) +isin(£*-"*)), PSO AL vot 
n n 


Here we obtain n possible values of the n-th root for k = 0,1,..., — 1. Other 
acceptable k do not result in new values of %/z. For example, for k = n the argument 
is argz = g/n + 27 and differs from the case k = 0 by 27, which corresponds to 
the complex number equal to it. 


Example 4.2. Denote roots of the equation z” = 1, where n is a natural number, by 
ar, k = 0,...,n — 1. Prove that on a complex plane the points corresponding to 
the values wx are located at the vertices of a regular n-gon, inscribed in a unit circle, 
whose centre is located at the origin of coordinates [24]. 


Proof According to the introduced definition, 
gee ae £20, .03,9 = 1 (4.3) 
In particular, for n = 4 we have the following values af 1 
Ray ae, ke {0,1,2,3), (4.4) 
or, after computing the complex exponents: 


ao = 1, i, —1, —i. (4.5) 
Modulus of the complex number a = e27!*/” is equal to one for all values of 
the variable k, and the argument is equal to arg@g = 2a7k/n,k =0,1,...,n—1. 
Thereby, we can conclude that the n-th roots of one are located on the unit circle 
C, and the first root wo, associated with k = Q, lies in the real axis, and w, divide 
the circle by n arcs of the same length (see the example for the instance n = 9 in 
Fig. 4.3). 


Fig. 4.3. Location of the 
roots of the n-th power of one 
on the unit circle for n = 9 
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4.2 Fundamental Theorem of Algebra 


Theorem 4.1 (Fundamental Theorem of Algebra) States that any polynomial of 
a non-zero degree with complex coefficients has a complex root [43]. This is why 
an arbitrary polynomial with real (or complex) coefficients always has some root 
zeC. 


Each polynomial of degree n 
p(z) =enz” +en-1z | +--- +00, co €C fori =0,1,...,n, cn #0, 
can uniquely (accurate to the order of cofactors) be expanded into the product 
P(Z) = cn(z — 21)" @ — 22)... — ze), 


where z; is a root of the polynomial p(z) with a multiplicity of mj, 1 <i <k. 

For polynomials with degree lower than the fifth, we can always find the roots 
having expressed them by arithmetic operations or arithmetic roots of an arbi- 
trary multiplicity, or radicals. The method for calculating the cubic polynomial’s 
roots was suggested by Cardanus* (see Sect.4.3 below), of the fourth degree 
polynomial—Ferrari.° However, there are no common methods for finding roots 
of polynomials of higher degrees, according to next theorem: 


Theorem 4.2 (Abel°-Ruffini’ Theorem) Any arbitrary equation of degree n for 
n > 5 is unsolvable in radicals. 


4.3 Cardano Formula 


In order to determine the roots of the cubic equation 


azt+bz-+cz+d=0, wherea,b,c,d€C, (4.6) 


b 
proceed as follows. Using the change of the variable z = y — 3a the equation is 
a 


brought to a canonical form 


y+pytq=0, p.qeC. (4.7) 


4Hieronymus Cardanus (1501-1576), Italian mathematician and philosopher. 
5Lodovico Ferrari (1522-1565), Italian mathematician. 

Niels Henrik Abel (1802-1829), Norwegian mathematician. 

7Paolo Ruffini (1765-1822), Italian mathematician. 
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By Cardano formula, the roots of the cubic equation y1, y2, y3 in the canonical 
form are equal [33] 


y= a+, (4.8) 
= a+ p : a—Bp 

y= ag 7: (4.9) 
_ a@t+p_ . 7p a—-B 

w= iv3 > (4.10) 


where 


=(§) +(§) 
Q ( y) AG) 
Using these relations, one should for each of the three values of the cube root of 
a take that value of the root f, for which the equality a8 = — p/3 is valid. 
Example 4.3 Find the roots of the equation z* —5z*+9z—5 = 0, using the Cardano 
formula. 


5 
Solution Replace the variable z = y + 3 We obtain the cubic equation in the 


canonical form 


a Aiyfcee A) (4.11) 
y 3> 27 ’ fs 


2 20 
here p = 7 q= a7 Further using Cardano formula (4.8)—(4. 10): 


a8 = = fe a = 5-104 6V3. 


1 
Leta = zV 6/3 — 10, then, in order for the condition wf = —p/3 to be 


1 
satisfied, we choose B = = 6/3 + 10. The roots of the equation will have the 
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n=5(Vo V3 —10—6v3 + 10): 
oak ie V3 - 10+ V6v3 i 


The obtained expressions can be simplified, if we note that the equality 6/3 + 


10 = (V3 £ 1)? is valid. Then V6V3 + 10 = V3 + 1, and 
y1 =; (v3-1-W3+b)=- 


y2= -¢(v3-1-W+n) 4° lietiide 


wht 
Re 
Ww Ww 


y3= -7(v5-1-W+0)-* (V5-14V341)=5-1 


5 
Returning to the original variable z = y + 3 we obtain z} = 1, z2 = 2+, 


73=2-i. 


4.4 Complex Coefficient Matrices 


Among the complex coefficient matrices, classes of Hermitian and unitary matrices 
play a special role in algebra and its applications. 


4.4.1 Hermitian Matrices 


Consider the matrix Z, containing the complex elements Z = (z;;), where i = 
1,2,...,m, 7 = 1,2,...,n. Hermitian conjugate matrix relative to Z is the 
matrix Z” , whose elements are equal to 

zt = iy. (4.12) 
In order to obtain Hermitian conjugate matrix, the operations of transposition and 
complex conjugation are applied to the initial matrix. The mentioned operations are 


4.4 Complex Coefficient Matrices 181 


independent and can be executed in any sequence. 


os ; ; 14+i2+3i], 
Example 4.4 A Hermitian conjugate matrix relative to Z = is the 
-1 5-4i 
matrix 
T\ * * 
ZH 1+712+4+3i _ 1+i -l = 1-i -l 
—-1 5-4i 24+ 315-—4i 2-—315+4+4i 


(4.13) 


Note Often, for designation of a Hermitian conjugate matrix, the notations Z* or 
Z* are used. 


Let us enumerate the main properties of the Hermitian conjugation operation: 
ce 

(Zp + Zo)" = ZU + ZH; 

.AZ)H=a*z® VWAEC; 

MZ) =z, 

. if AW! exists, then aa = (a4); 
. det A” = det A* = (det A)*. 


Nn BWN PE 


Theorem 4.3 Jf for the complex matrices Z, and Z2 the product Z\Z> is deter- 
mined, then 


(Gey =Z7 2" (4.14) 


Proof The validity of the theorem follows from the property of the matrix product 
transposition (see Problem 1.3): 


(Zim) =z 7. (4.15) 
With the help of Eq. (4.15) we obtain a chain of equalities 
(Z1 Za)" = ((Z1Z2)"* = (25 ZT )* = (23) (Z)* = ZF ZF, (4.16) 


which proves the identity (4.14). 


Among complex matrices, Hermitian matrices are very widely used. Hermitian 
matrix is a square matrix, where Z” = Z. A respective condition for the elements 
of such a matrix: Vi, j (zij = zi i): In other words, the Hermitian matrix coincides 
with its Hermitian-conjugated. 
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Example 4.5 The matrix Z = See is Hermitian, as is easy to see. 
—3+i 1 
Let us verify it. 
T\ * * 
ZH 1 -3-i = I. “23-44 = 1 -3-i 
—3+i 1 =3=i | —3+i 1 


(4.17) 


Recall that applying the complex conjugation operation to a real number does 
not change this number. 


Note Hermitian matrices are also referred to as self-adjoint matrices. The theory of 
self-adjoint matrices is widely used in modern physics [44]. 


4.4.2 Unitary Matrices 


A square matrix U with complex elements is called unitary, of the condition 
U"U = 1 is met. The condition of unitarity can be written in other equivalent 
forms as follows: 


UU® =1 or UF =u7!, (4.18) 


1 1 1 
Example 4.6 Prove that the matrix Z = —= is unitary. To do this, compute 


2 | az 


the product Z” Z: 


zHz— 3 1 1 1 11); 111i 1 11} |10 


JV2\) -i i /2| =i i 2 taf | V2 ae e 01 


Therefore, Z is a unitary matrix. 


Theorem 4.4 The determinant of a unitary matrix is a complex number whose 
modulus is equal to one. 


Proof See in Problem 4.51. 


There is a close connection between unitary and Hermitian matrices: each unitary 
matrix A is presented in the form A = exp(iB), where B is a Hermitian matrix [26]. 
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4.5 Fundamentals of Quantum Computing 


In quantum computers, for implementation of computing, processes of a quantum 
nature are used manifested in experiments with objects of the microcosm— 
elementary particles, atoms, molecules, molecular clusters, etc. The description 
of such processes is based on the application of complex numbers and complex 
matrices. 

As is well known, the basic notion of classical information theory is bit [18]. A 
classical bit takes the values 0 or 1 (and no other). 

Qubit (quantum bit) is the smallest element that executes the information storage 
function in a quantum computer [6, 54, 75]. 

Qubit is a quantum system |y) that allows two states: |O0) and |1) [54]. In 
accordance with the so-called bra-ket (bracket) Dirac® notation, the symbols |0) 
or |1) are read as “ket 0” and “ket 1”, respectively. The brackets |...) show that w 
is some state of the quantum system. 

The fundamental difference between the classical bit and the qubit consists in 
that the qubit can be in a state different from |0) or |1). The arbitrary state of the 
qubit is defined by the linear combination of basic states: 


ly) =u|0) + v1), (4.20) 
where the complex coefficients u and v satisfy the following condition: 
Jul? + vl? = 1. (4.21) 


The mathematical description of the basic states reduces to their representation 
in matrix form: 


|0) = » [l= (4.22) 
Based on the presentation (4.22) the arbitrary state of the qubit is written as 


Iv) = . (4.23) 


8Paul Adrien Maurice Dirac (1902-1984), English physicist. 
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A system of two qubits is set by a linear combination of basic states 


1 0 0 0 
0 1 0 0 
|00) = , |Olb= , |10)= , TFh= (4.24) 
0 0 1 0 
0 0 0 1 
Similarly are introduced the states 
|00...00), |00...01), ...,  [1l...11) (4.25) 


of several interacting qubits. Such quantum states are called computational basis 
states or, for short, basis states. 

For changing the state of a quantum system, quantum operations are used referred 
to as gates (quantum gate). Thus, the gates perform logical operations with qubits. 
Note that the change of the state |y) in time is also referred to as the evolution of 
the quantum system. 

An important step of quantum algorithms is the procedure of measurement of 
state. When the qubit state is measured, it randomly passes to one of its states: |0) 
or |1). Therefore, the complex coefficients u and v from the qubit definition (4.20) 
are associated with probability to get the value 0 or | when its state is measured. 
According to the postulates of quantum theory, the probabilities of passing to the 
states |0) and |1) are equal to |u|? and |v|?, respectively. In this connection, the 
equality (4.21) reflects the probability conservation law. After the measurement, the 
qubit passes to the basic state, complying with the classical result of measurement. 
Generally speaking, the probabilities of getting the result 0 and 1 are different for 
different states of the quantum system. 

In other words, the quantum computing is a sequence of simple form operations 
with the collection of the interacting qubits. In the final step of the quantum 
computing procedure, the state of the quantum system is measured and a conclusion 
about the computing result is made. The measurement makes it possible to obtain, 
at a macroscopic level, the information about the quantum state. The peculiarity of 
the quantum measurements is their irreversibility, which radically differentiates the 
quantum computing from the classical one. 

A quantum system, formed by N two-level elements, has X(N) = 2% 
independent states. The key point of functioning of such a system is the interaction 
of separate qubits with each other. The number of states X(N) grows exponentially 
with the growth of the quantum system, which allows solving practical problems of a 
very high asymptotic complexity (see section “Estimation of Algorithm Efficiency” 
on page 13). For example, an efficient quantum algorithm of prime factorization 
is known, which is very important for cryptography. As a result, the quantum 
algorithms provide exponential or polynomial acceleration in comparison with the 
classical solution methods for many problems. 
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Unfortunately, no full-function quantum computer has been created yet, although 
many of its elements have already been built and studied at the leading world’s 
laboratories [72]. The main obstacle to the development of quantum computing 
is instability of a system of many qubits. The more qubits are united into an 
entangled system, the more effort is required to ensure smallness of the number of 
measurement errors. Nevertheless, the history of quantum computer development 
demonstrates an enormous potential laid in the uniting of quantum theory and 
algorithm theory. 

Prior to proceeding to describing the basic quantum operations with qubits, let 
us introduce the notion of the Pauli’ matrices and the Dirac matrices. 


4.5.1 Pauli Matrices and Dirac Matrices 
The matrices 01, 02 and 03 


01 0 -i 1 0 
Oo = , Q= , B= (4.26) 
10 i O 0-1 


are called the Pauli matrices. They are widely used in quantum theory for 
describing half-integer spin particles, for example, an electron. (Spin is a quantum 
property of an elementary particle, intrinsic angular momentum [44]. So, electrons, 
protons and neutrino have half-integer spin; spin of photons and gravitons is 
integer). 

The following properties are valid for the Pauli matrices. 


1. The Pauli matrices are Hermitian and unitary: 
H -1 
Vk € {1,2,3} o,=oy =o. (4.27) 


2. Wk € {1, 2, 3} the square of the Pauli matrix is equal to the identity matrix: 


> | 10 
op = (4.28) 
01 


3. Vi, j € {1, 2, 3} the equalities are valid 


10 
Oj0j; + ojo; = 226i; : (4.29) 
01 


° Wolfgang Ernst Pauli (1900-1958), Swiss and American physicist. 
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Sometimes, in linear algebra and its applications, one has to use matrices split 
into rectangular parts or blocks [25, 47]. Consider the rectangular matrix A = (aj;), 
wherel <i<gm,1<j<n.Letm=m,+m2,n=n,+n2. 

Let us draw horizontal and vertical lines and split the matrix A into four 
rectangular blocks: 


(4.30) 


Thus, the matrix A is presented in the form of a block matrix, consisting of the 
blocks By1, Bi2, B21, Boz of size m, x nj,m , Xn2,m2 X Nj, M2 X N2, respectively. 

As an example of block matrix setting, we provide the definition of the Dirac 
matrices. Four Dirac matrices a), a2, a3, 6 are part of the equation named after 
him [4], for a half-integer spin relativistic particle, and are expressed in terms of the 
Pauli matrix o;, k = 1, 2, 3, as follows: 


O ox IO 
ak = , p= ; (4.31) 
ox, O O-I 


where O is a non-zero matrix of size 2 x 2, J is an identity matrix of the same size. 
(Relativistic are the particles whose velocity is close to the light velocity.) 

Each of the Dirac matrices has a Hermitian property and a unitary property. 
Moreover, for all J, m € {1, 2, 3} the equalities are valid: 


010m + Amor = 2m, (4.32) 
ap + Ba; = O. (4.33) 


Note that the size of the matrices J and O in formulae (4.32) and (4.33) is equal to 
4x 4. 


4.5.2 Basic Operations with Qubits 


Consider the basic operations with qubits. 
The influence of a quantum gate on the qubit |w) is exerted by applying a 
quantum-mechanical operator, for example, U |y) [67, 72]. The operators may 
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be presented in the form of unitary matrices. In particular, the evolution of a single 
qubit is described by a unitary matrix of size 2 x 2. 

Successive application of the series of operators U;, U2, ..., Un to one qubit 
is equivalent to the influence of the operator W, whose matrix is a product of the 
matrices U,;U2...U, [54]: 


Un(Un—1(.--(U2(U1 |W))) ...)) = 12... Un) |W) = WI). (4.34) 


Such operator W is called a composition of operators U;, U2,..., Un. As a 
consequence of non-commutativity of the matrix multiplication operation in the 
general case the sequence of applying the quantum gates is of importance. 


Example 4.7 Let us show that the application of the operator o3 (see defini- 
tion (4.26)) to the qubit in the state |W) = uw |0) + v|1) transfers it to the state 


|W’) = w|0) — v1). 


Proof Write the qubit |y) in matrix representation: 


Iv) = ; (4.35) 


Define the action of o3 on this quantum state: 


; Uu 1 0 Uu u 1 0 
lw’) = 03 — = =u —v f =u|0)—v|1). 


Graphic representation of quantum operations in the form schemes or diagrams 
(quantum circuit) is widely used. 

Some quantum-mechanical operator U that transforms a single qubit (one qubit 
gate) is represented as follows: 


lin) U | Wout) 


The sequence of steps of quantum algorithms corresponds to the direction on the 
diagram from left to right. 

In Table 4.1 the gates are enumerated that transform one qubit, and the matrix 
representations of these gates. 

Let us show the method of computing a quantum operation matrix based on its 
action on the basic vectors. 
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Table 4.1 One qubit operations 
Name ‘Designation ‘(| Matix 


Identity transformation 
Pauli element X 


Pauli element Y 
Hadamard element 


10 
Phase element 5 

07 
Element 2/8 


—a— 
—}— 
Pauli element Z = i : j 
—{x}— 
el 
Tk 


Measurement IA Projection on |0) and |1) 


Hadamard!° gate transforms the system’s state by the rule: 


1 
0 — (\0 1)), 4.37 
ee (4.37) 
1 
1 —(|0) — |1)). 4.38 
I) > 0) |1)) (4.38) 


Therefore, the arbitrary state |) will in this case change as follows: 


u 1 0 
wy =]|o] sul [+v] | 
v 0 
> ua (JO) + 11)) + v1) — 1 ==] * | (4.39) 
u— v— = =— : ; 
J2 J2 V2/1-1] |v 
: . 1 fil 
Thus, the Hadamard element is presented by the matrix —— 
V2\)1-1 


10y acques Salomon Hadamard (1865-1963), French mathematician. 
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Table 4.2 Two qubits operations 


Name Designation | Matrix 
1000 

Exchange oe 
0100 
0001 
1000 


Controlled NOT 0100 
0001 


0010] 
1000 
0100 


Controlled phase element 
0010 
0007 


Obviously, in order to execute complex algorithms, the qubits must interact with 
each other and exchange information. In this connection, of particular importance 
are the logical operations that affect two or more qubits. In Table 4.2, the most 
important gates are enumerated, that transform the state of two qubits. 


Example 4.8 Let us determine how the qubit |y) is transformed under the action of 
two applications of the Hadamard element: 


lv) H H Iv’) 


Solution As shown above, in matrix representation, the Hadamard element is 
described by the matrix 


1/11 
i= : (4.40) 
2 | et 


Compute the matrix that corresponds to two applications of the Hadamard 
element as a matrix product (see formula (4.34)): 


1 }11 1 j11 
MyMy = —= —S = 


1/2 
= (4.41) 
Je iat | a2 i=] 2/0 


An identity matrix is obtained, therefore, two applications of the Hadamard 
element bring the qubit back to its original state. 


190 4 Complex Numbers and Matrices 


Example 4.9 Find a matrix representation of the following quantum circuit: 


i 


WY 


Solution The quantum circuit consists of two elements “controlled NOT”, also 
referred to as “CNOT” (Controlled NOT). 
The matrix of the element CNOT has the form (see Table 4.2): 


1000 
0100 
0001 
0010 


(4.42) 


In view of the matrix representation of CNOT, compute how the arbitrary state 
lwiv2) = (uy |O) + vq |1))(u2 |0) + v2 |1)) will change after the action of the first 
CNOT: 


uy 1000 uy uy 
VI O100) | vy v1 
lwiv2) = = = : (4.43) 
u2 OO00O1] | uw v2 
v2 0010] | v2 u2 


This is equivalent to the fact that the states of the computational basis (4.24) are 
transformed in accordance with the rule: 


> |01), 
— |11), 
—> |10). 


(4.44) 


Note that the next element CNOT takes the input states in the reverse order 
relative to the first element. In this case, the basis state transformation rule has the 
form: 


— |00), 
> {11 
—> |10 

|O1). 


| , (4.45) 


’ 


[10 
[11) > 
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Therefore, the next step of the quantum system evolution is described by the matrix 


1000 
0001 
(4.46) 
0010 
0100 
We perform the matrix computations: 
ut 1000] | uw ul 
0001 
sa P= sey eS faa (4.47) 
v2 0010 v2 v2 
u2 0100 u2 al 
As a result, the original state |Wwi¥2) = [u1, v1, U2, v2]? passes to 


[uj, U2, V2,v il. The matrix representation of the circuit under analysis can be 
written in the form: 


1000/]1000 1000 
0100;;0001 0001 
= : (4.48) 
0001/;;0010 0100 
0010/|0100 0010 


Review Questions 


. What is a complex number? 

. Enumerate arithmetic operations on complex numbers. 

. How is the number conjugate of a given complex number found? 

. How can a complex number be presented geometrically? 

. Explain the differences between the following forms of complex numbers: 
algebraic, trigonometric and exponential. 

6. Write Euler’s formula. 

7. Howis the root of the n-th order of a complex number found? How many values 

does it take? 
8. Formulate the fundamental theorem of algebra. 


nABWN 
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9. What is Cardano formula used for? 
10. What matrices are called Hermitian? unitary? 
11. Define the concepts of “bit” and “qubit”. 
12. What states are referred to as the computational basis states? 
13. What is gate? 
14. Explain how the quantum state is measured. 
15. Write Pauli and Dirac matrices. 
16. Enumerate the basic operations on one qubit. 
17. What quantum operations are applied to a system of two qubits? 


Problems 


4.1. 


4.2. 


4.3. 


4.4. 


4.5. 


4.6. 


4.7. 


Perform algebraic manipulations and represent the specified complex num- 
ber z in the algebraic form z = Rez +iImz: 


(1) @+i)(2 + 5i); 
(2) 8-iGB+i); 

(3) d-14+04+0)%; 
(4) 2-3. 


Perform algebraic manipulations and represent the specified complex num- 
ber z in the algebraic form z = Rez +iImz: 


dy @=HNE1L+4%); 
(2) (6+ 5i)(4—i); 
(3) (1+ 31)? + (2—i)?; 


(4) P—7. 
Given are the complex numbers z} = 5 +i, z2 = 4—i, z3 = —1+4 31. Find 
Z1Z3 — 23. 
Given are the complex numbers z; = | — 27, z2 = —1 +i, z3 = —i. Find 


z122(Z3 — Z3). 

Perform the actions: 

(1) (+41)? +  — 41); 

(2) 6—i)* + 6+i)4. 

a+ib 
c+id’ 


Having performed the division, represent the complex number z = 
where a,b,c,d € R,c 4 0,d £ 0, in algebraic form. 

Simplify the expressions: 

3 +7. 


3-77 
143i. 


1—3i’ 


(1) 
(2) 
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2+ i 


347° 
i i 


(3) 
(4) 


I+i 1-i 
4.8. Given are the complex numbers z} = 2+ i, z2 = zi» Z3 = Z1 + Z2. Find 
(z1 — 23) (Z2 — 23)/Z2. 
4.9. Given are the complex numbers z; = 7 + 2i, z2 = —z1, 73 = 2—- a Find 
(z1Z2 + 2223 + 2123)/Z1. 
4.10. Find the number conjugate with the number z, if: 


pons. 
= 143i) 

no) een 

ca Wag 


4.11. Find z, if z—3z* =184 4i. 
4.12. Find z, if 3z* —7z = 10 — 10i. 
4.13. Prove that for arbitrary z;, z2 € C the equalities are valid: 


(1) (21 + 22)* = 27 +25; 
(2) (z1z2)* = 2723. 


4.14. Prove that if |z| = 1, then z~! = z*. 
*4.15. Prove that for any z1, z2 € C triangle inequalities are valid: 


abs(|z1] — |z2l) < |zi + zal < lzil + |zal. 


+4.16. Calculate the sums: 
10 
@) ye 
k=1 


49 
Q). yo i. 
k=—49 
4.17. Simplify the expression i” for arbitrary m € Z. 
4.18. Represent the complex number in a trigonometric form: 


(2; 
(2) 3i; 

(3) 4+ 31; 

(4) -i; 

(5) —3 — 6i; 

(6) V2(1 + i); 
(7) J3(-1 + 3i); 


9+i 
®) 9-i 
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4.19. Represent the following complex numbers in algebraic form z = Rez + 
ilmz: 


(1) T Pere Uv 
= cos — sin —3 
: coe 


(2) z=4 (cos(—5) +i sin(—)); 


3 
3) z= —. >; 


10. OTE 
cos — +1 sin — 


be ae Ten. 
(4) z= cos(—-) + i sin(—=). 


ib 
4.20. Prove that a complex number of the type u = — where a,b € R, can 
a-i 


be presented in the form of an exponent with purely imaginary index, i. e. in 
the form 


4.21. Calculate i!. 
4.22. Calculate: 


(1) V8i; 
(2) 4096. 


*4,23. Prove the validity of the identities for the roots of unity w,, whereO < k < 


n—1 [24]: 
(1) @k4n/2 = —@x forevenn andO <k <n/2—1; 
n—1 
(2) >> ax =0 forn > 1; 
k=0 
n—1 
(3) [[ ox = ol a for all natural values n. 
k=0 


*4,24. Prove the validity of the identities for the roots of unity wy, whereO < k < 
n — 1, for all natural values n: 


n—1 
Q) I] @-a) =2"-1; 
k=0 
I 0, l<d<n-l, 
(2) DY (wx)4 = 
k=0 n, d=n. 
*4.25. Prove the validity of the identities for the n-th roots of unity wg, where k = 
0,1,...,n —1, for all values n > 2: 


n—2 
(1) SO woes = —@n-1; 
k=0 
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N=2 We_1Ok-+1 

2) y= 1 toi); 
k=1 Wk 
n= 


*4.26. 


+*4.27. 


4.28. 


*4.29, 


+*4.30. 
+431. 


ay 


(3) DE wax = 0; 


k,k’=0 
k<k’ 


n—1 WOK! n 


Khao Ok'-k = 1 — wn 


k<k' 
Let w,, where 0 < k < n — 1, are the n-th roots of unity, x is an arbitrary 
n—1 Wk 
complex number, and x 4 wx for any k. Calculate the sum )* : 
k=0 ¥ ~ OK 


Prove that for natural n € N anda ¢ 2zk,k € Z, are valid the Lagrange’s!! 
identities [38]: 
i 2 
(a) cosa +cos2a +---+cosna = aie cos[(n + 1)a/2]; 
sin (a /2) 
. ; . sin(na/2) , 
(b) sina + sin2a +---+sinna = ———— sin[(n + l)a/2]. 
sin (a@/2) 


Prove de Moivre’s formula for natural values of the exponent n, using the 
mathematical induction method. 

Using de Moivre’s formula, express cos 3g and sin 3g in terms of cos g and 
sin g. 

Express cos 4@ and sin 49 in terms of cos g and sin g. 

There exist relations that express polynomial’s coefficients by its roots 
(Viéte!? formulae). If a), a2, ..., @, are roots of the polynomial p(z) = 
x” + ayx"—! 4... + a,, and each root is taken the number of times 
corresponding to its multiplicity, then the following equalities are valid: 


a} +02 +++ +n = —a1, 
12 + 0203 + +++ +A1Ay + 0203 +--+ + An—-1%n = 42, 


O11 01213 + OL] A2A4 + +++ + An—2Un-1An = —A3, 


—1 
OQ... Ap—| + A] A2Ay—20n +--+ + 0203...An = (—1L)"” an-1, 


aja2...A, = (—1)" ay. 


Prove validity of Viéte formulae. 


'l Joseph-Louis Lagrange (1736-1813), French mathematician, mechanic and astronomer. 


Francois Viate, seigneur de la Bigotiére (1540-1603), French mathematician. 
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4.32. Let 21, z2 be the roots of the quadratic trinomial p(z) = z*-+uz+v. Express 
the following values by the coefficients u and v: 


2 em 
(1) zy + 25; 
OE aa cs 


4.33. Let z1, z2 be the roots of the quadratic trinomial p(z) = z? +uz+v. Express 
the following values by the coefficients u and v: 


(1) 24 +23; 
(2) zp +25". 

4.34. Find the sum and the product of all roots for each equation: 
(1) 22 +32°+3z+1=0; 
(2) 24+ 1027 +20 =0. 

4.35. Find the sum and the product of all roots for each equation: 
(1) 21 — 10027 + 28 = 0; 
(2) P+24+1=0. 


*4,36. Compute the determinant 


Z1 22 23 
A = |z3 Z1 Z2]> 


Z2 £3 Z1 


where z 1, Z2 and z3 are the roots of the cubic equation O+tazt 6B = Owith 
complex coefficients a, B € C. 
*4,37. Solve the equation 2 —3z+2=0, using the Cardano formula. 
+4,38. Solve the equation 2z* — 13z? — 17z + 70 = 0, using Cardano formula. 
4.39. Solve the systems of linear equations using Cramer’s rule: 


(1+ i)x, + (-24+ 2i)x2 + 1 +i)x3 = 1—i, 
(a) —x1 + 3 —i)x2 + (2+ i)x3 = —7 + 2i, 
(—2 —1)x, + (-1 —i)x3 = 3 — Si; 


(—2 + 2i)x, + (2+ i)x2 + (24+ 21)x3 = Ti, 
(b) (—1 4+ 2i)x, + (—2 — i)x2 + 2ix3 = —8 +4 2i, 
(2 + 2i)x1 + 2ixe + UI —i)x3 = 244i; 
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4.40. 


+*4.41. 


(-1+i)x3 = 1-33, 
(c) (-1+i)x1 — x2 + (-1 —i)x3 = 3 — 2i, 
xy + (44+ i1)x24+ 3 —-i)x3 = 44+ 81; 

(4+ i)x, —ix2 + 3x3 = —i, 


(d) 3x, + 4x2 + I — i)x3 = —44 91, 
ixy + 3x2 +%x3=-5+ 61. 


Solve the systems of linear equations using Gaussian elimination: 
(2+ 1)x1 + 1 —1)x2+ 1 — 1x3 = 81, 
(a) (1 —i)x, + (-1+i)x2 + ix3 = —2, 
—ix, + G+ i)x24+ (-1 -—1)x3 = 844i; 


(4—i)x1 + 4—i)x2 — ix3 = 2+ 9i, 
(b) 5x1 + 3x2 —ix3 = 5+ 10i, 
2x, + (2+ i1)x2+ U1 +1)x3 = 345i; 


(1 + 2i)x; + (3 + 2i)x2 + (—-2 + 21)x3 = 11 +i, 
(c) (4—i)x, + (-1 4+ 2i)x2 — x3 = 84+ 31, 
(+ i)x; + 2ixg = 44 81; 


5x, + (—2 —i)x2 + (-3 — 1)x3 = —12-3, 
(d) ixy + (1+ 2i)x2 + (2 — i)x3 = —3 +i, 
2ix2+x3 = —4+i. 


Solve the systems of linear equations relative to five variables using Gaussian 
elimination: 


(4 — i)xy tixg + (—1 4+ i)x3 + (44 2i)xg + (—3 + i)x5 = —5 — 91, 
2x1 — x2 — 3x3 + (2—i)x4 + (—2—i)x5 = 3 — 5i, 
(a) (1 —i)xy + (1 —a)x2 + (2 — 1x3 + (—2 — i)xg + 3x5 = 104 11, 
(1 + 2i)x, — 2xp + (24+ 21)x3 — 3x4 + (—2 4+ 1)x5 = —2, 
(—3 + 2i)x, + xp + (—2 + i)x3 — ixg + 2ixs = —11 + 3i; 
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3x, + (—3 — i)xo + (2 —1)x3 + (—2 +: i)x4 +ixs = —2 -i, 
xy + ixe + (-3 + 2i)x3 + 1+ i)x4 4+ (44 21)x5 = 94 123, 
(b) 2ixy + (2 — i)xz + 3x3 + 2x4 + (—2 —i)x5 = —9 — 5i, 
(—3 — i)x1 + (—3 — i)x2 + 1x3 + 1 + i)x4 + (—1 + i)xs = 4, 
(2+ i)x,; + 2x2. + 1 +i)x3+( + 21)x5 = —3 4+ 10i. 
4.42. Which of the following matrices are Hermitian? 


1 2+ 10: 
(1) 
2 — 107 
a) ~ 
2+ 2i 2i 
—2i —2+4+2i 
10 2 
: 03 0 4]; 
) aR : 
20-i 
11 11 
1 i -l-i 
(6) 
1-1 1-1 
1-i-1 i 


4.43. What condition is imposed on the diagonal elements of the Hermitian 
matrix? 
4.44. Show that if Z; and Z2 are complex matrices of the same order, then the 


1 
matrix =(Z,Z2 + ZZ) is Hermitian. 
4.45. Anti-Hermitian or skew-Hermitian matrix is the matrix A, for which the 


relation A” = —A is fulfilled. In other words, Hermitian conjugate of such a 
matrix results in multiplying all its elements by (— 1). Which of the following 
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4.46. 


4.47. 


matrices are anti-Hermitian? 


1 O 2+i 
dq) = 
V2 S049 7 
i 
(2) : 
3 -2i 
1);2+i 2i 
(3) 5 ; 
Bo). Oy Hoey 
: 15 0 
4) =| —5 2i —5 }; 
4) = 5 2i —5 
0 5 3i 
; —2i1 1 0 
5) =} -1 0-3]; 
O) 5 10-3 
0 3 2i 
iid 
it -i-l 
(6) 


i-i i -i 
i-i-i i 
What condition is imposed on the diagonal elements of an anti-Hermitian 


matrix? 
Which of the following matrices are unitary? 


123 
(1) : 
321 
(2) 1 1 -i 
a2 -i l ; 
1);2+i 2 
(3) 
3 | 22% —2+i 
(4) 1 3 -i 
V10] -i 3 
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4.48. 


4.55. 
4.56. 
4.57. 


4.58. 


. Prove the identity det(e 
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10 0 
(6) loc 0 |: 
J/10 
0 0 ei 
1111 
ee a 
21 it =f 1 
a 


What condition must satisfy the complex numbers u; and uz and the real 


: : uj u2 : 
number 9g, in order for the matrix to be unitary? 


—e'Pus e' Put 


. At the examination in linear algebra the student says that for any square 


matrix A the equality In(exp A) = A is valid. Is the student right? 
A) = eA, valid for the arbitrary square matrix A 
with complex coefficients. 


. Prove the Theorem 4.4 (see page 182): the determinant of a unitary matrix is 


a complex number, whose modulus is equal to one. 


. Compute the commutators of the Pauli matrices [01,02], [02,03] and 


[o3, 01]. 


. Compute the product of the Pauli matrices 010203. 
. Prove the generalization of Euler’s identity for Pauli matrices 01, 02 and 03: 


exp(iogg) = ITcosg+iox sing for k = 1, 2,3, (4.49) 


10 
where J = is the identity matrix of the second order. 


What is the square of the Dirac matrices 6? 
Compute the product of the Dirac matrices a1}a2036. 
Compute the result of the actions of the quantum circuit 


xX H S H 


on a qubit that is originally in the state |0). 
Compute the result of the actions of the quantum circuit 


Y HH Z T 


on a qubit that is originally in the state |1). 
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4.59. Compute the result of the actions of the quantum circuit 


T H S H S 


on a qubit that is originally in the state |W) = uw |0) + v|1), where u, v € C. 
4.60. Show that the quantum circuit 


TT] 


WD iv 


transfers the state |wW1 w2) to the state |woy1). 
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4.1 Solution. 


(1) Operations with complex numbers should be performed similarly to the respec- 
tive operations with algebraic polynomials, using the property i7 = —1: 


(3 +i)(2+5i) =3-243-(5i) +i-2+i- (Si) 
=6+ 151+2i4 5i7 =64 171 +5(-1) =14 171; 


(2) 8-i)G+i) =3? -i? = 10; 

GB) d-i4+04 4 = (0-414 67 — 433 +14) + (1 4+ 47 +: 672 +. 473 + 74) 
=2(1+ 677 + (i7)?) = 2(1+6(-1 + (—D?) = -8; 

M227 297-7 S247. 


4.2 Answer: 


(1) —14 3i; 
(2) 29 + 147; 
(GQ) S34 255 
(4) 27. 
4.3 Solution. 
We perform algebraic transformations, taking into account that i7 = —1: 
fies es = (a a ai =o GD Pos Gir (1) e2- 
(3i) — (16 — 81 + 7) = —8 + 147 — 15 4 8i = —23 4 223. 
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4.4 Solution. 

We perform algebraic manipulations: z;z2 = | + 31, 2 = 1, a = -11+4+2i, 
a - ro = 11 -—i. As aresult z1z2(z3 _ a) = 14+ 32i. 
4.5 Solution. 


(1) We perform algebraic transformations: 


+4i)? +a —4i)3 
= (14+4i+1-4i)((1 4+ 4i)? —  — 41). + 41) + 1 — 47)”) 
= 2(1+ 8i — 16—17+1-—8i — 16) = —47-2 = —94, 


(2) Use the Newton! binomial formula: 


n 
(a+b)" => C(n, ka" *h, 
k=0 


valid for all a, b € C and natural n: 


(6—i)'+ 6+i)4 =6—4.6-14+6-6-(-1) —4-6i-(-1) +(-1I)-(-D) 
+ 6444-6 -14+6-6-(-1)+4-6i-(-1) + (1) - (-D = 2162. 


4.6 Solution. 


; : : Zz 
Denote z} = a+ib, z2 = c+id. The fraction of the form = where z1, z2 € C, 
22 


zr 
can be conveniently transformed by multiplying it by 1 = =: 
9) 


Zi Zi ay %. Sie. She 
Zz «22 22 2 22%9~— (Z| 


This is why the result of the division of two complex numbers z;/z2, where 
z2 # 0, will be the number 


Zl ac+bd .bc—ad 


2a +d ee 


'3]Tsaac Newton (1643-1727), English mathematician, physicist, mechanic and astronomer. 


Answers and Solutions 203 


4.7 Solution. 
Using the hint offered in the previous problem (i. e. z1/z2 = (z123)/(z2z5) = 
(z123)/lz2|7), we obtain 


34+i G+iBti) 1 


1 —(4+4 33); 
O— a= 5(4+3i) 
143%. G43 4 
2) = ~*~ = (-44 33); 
=e ee 
@) 24i7 2-1 | D cua 
RG fae 10°. 


i i id-—i)+id+i) . 
fe ici Ganda" ~ 
4.8 Solution. 

Find z2 and z3: z2 = 2 —i, z3 = (2+1)+ (2—i) = 4. After this, we obtain 
(21-23) (22-23) /z2 = (2+i—4) (2-1-4) / (2-1) = —(2—i)(—2—-1) /(2—i) = 2+. 
4.9 Solution. 

Find z; and z2: z2 = —7 — 27, 73 = —5 + 2i. 

Then, z)z2 = —45 — 28), 2273 = 39—41, z1z3 = —394+-4i, 2122 +2273 +2123 = 
—45 — 281. 

The final answer is (z1Z2 + 2223 + 2123)/Z1 = —7 — 2. 


4.10 Answer: 


(4) 


(1) z= al 3i); 
(2) z= 52-9). 


4.11 Solution. 
Let z = a+ib, then z—3z* = (a+ib) —3(a—ib) = —2a+4ib. Since complex 
numbers are equal if and only if their real and imaginary parts are equal, we obtain 


—2a = 18, a= -9, 
>? 
4b = 4: b=1, 


whence z = —9 +1. 
5 
4.12 Answer: z = a +i. 


4.13 Proof. 
Let z} = x1 + iy, 22 = x2 + iy2, where x1, x2, v1, y2 € R. 


(1) Express the left side of the equality by x1, x2, y; and yo: 


(21 + 22)* = [1 +22) +iG1 + y2)]* = G1 +42) — i011 + y2). 
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Now transform its right side 
z+ 2 = (1 — iy) + 2 — iy2) = 1 + x2) — iQ + ya). 


Therefore Wz1, z2 € C (z1 + z2)* = zj + 25. 
(2) The left side of the equality 


(z1Z2)* = LQ + iy) @2 + ty2)]* = (rix2 — yiy2) — iriy2 + x21). 
The right side coincides with the left one: 
22> = (41 — iy1) (x2 — iy2) = (x1x2 — yr y2) — i@1y2 + x21) = (Z122)*. 
4.14 Proof. 


Take the number with the modulus equal to one in exponential form: z = e!?. 
After algebraic transformations 


zl = (e'?)—! =e i? = (e!?)* _ z* 


we obtain the equality z~! = z*. 


4.15 Hint. 

Use the geometric interpretation of the numbers z; and z2. The length of a side 
of an arbitrary triangle is no greater than the sum of the lengths of the two other 
sides, and is no less than the absolute value of their difference. 


4.16 Solution. 
Use the formula for the geometric progression sum: 


“id n+1 
q 74 
k=l q 
a veel oD Gta D5. ans 
fmt al f= - G=he =f 
M4 1 99 este 
(2) do i = 5 Yi ; 
k=—49 k=—99 


In the last sum, replace the summation index k’ = k + 49. Then the sought sum 
takes the following form: 


49 98 .99 9 
7 wo eag_jl? -— 1 , b=] 
kar? ik =i * clan me 
i-1 i-1 
k=—49 k’=0 
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4.17 Solution. 
The imaginary unit has the property i = 1. Consider four cases depending on 
the remainder of division m by 4: 


(1) m= 4k,k €Z, 


j= j+k = (i*)* = 1* — 1: 


(2) m=4k+1,k €Z, 


i? a (YW * fa i Si: 


(3) m=4k+2,k eZ, 


(4) m= 4k +3,k €Z, 


Thus, we finally obtain 


1, ifm=4k, keZ; 

i, ifm=4k+1, k eZ; 
-1, ifm=4k+2, k eZ; 
—i, ifm=4k4+3,k eZ. 


4.18 Solution. 


(1) For transition to a trigonometric form of the complex number, we must 
determine the modulus p = |z| and the argument g = arg z. Using the formulae 
for p and g, we obtain 


p= [x24 y2 = /22 4 02 = 2, 
g = arctan ~ + 20k = 0+ 2xk = 2k, keZ, 
x 


this is why the trigonometric form of the number 2 has the form 


2(cos(27k) +i sin(27k)), where k € Z. 
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(2) p = V¥0+32 = 3,9 = =sen(3) +2nk = > + 2zk, therefore 3i = 
3 (cos (= + 2k) +isin (= + 2nk)), keZ. 
2 2 - 
(3) p=V16+9=5, o = arctan (; + 2m), 
3 3 
44+3i=5 (cos (arctan Fi + 2mk) +isin (arctan zi + 2ak)), keZ. 


Gi geiee “sgn(—1) ca a + 4 Ink: 
—i = cos(a(2k — 1/2)) +i sin(a(2k — 1/2)),k € Z. 
(5) p = /9 +36 = 305, o = arctan2 — 7 + 2rk, 
-3-6i= 3/5 (cos(arctan 2 + (2k — 1)) +i sin(arctan2 + 27(2k — 1))), 
keZ. 
(6) p=2,9=7/4+4 27k, 
J/2 + J/2i = 2(cos(r/4 + 2k) +i sin(r/4 + 2k)), k € Z. 
(7) p= 30, gy = arctan(—3) + 2 + 27k, 
—J/3+3/3i = /30(cos(x — arctan 3+ 27k)) +i sin(w — arctan 3+ 2zk)), 
keZ. 
(8) Multiply the numerator and the denominator of the fraction by the value (9 +7) 
and transform the obtained expression: 


9+i O+O+i) 804181 40 9. 


—--—l, 


9-i (9-i)9-i) 82 °&41 41 
40\2 9\2 9 
p= (=) +(=) =e Sacre bank We finally obtain 
9+i 9 9 
9 ~ = cos (20k — arctan a) +i sin (27k — arctan ae k eZ. 
4.19 Solution. 
(1) 7 + isin ba 
cos — sin — = ~ + —i; 
ie a ee 
(2) 4( cos (-) +isin(->)) = —4i; 
V3 v6. V6 
(3) ———_—— = - = 1 - eS; 
3x, , 3B 2 2 
cos — +1 sin — 
4 4 
2 2 2-2 
(4) cos (— 2) +i sin(—2) = ce ane) 
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4.20 Proof. 
Turn to the exponential form of the number u. Leta + ib = pe'?, thena —ib = 
pe '? and 


a+ib el? io i 
= — = = el . el? = 29 
a—ib e'? 
Therefore u = e!°, where 5 = 29. 
4.21 Solution. 
The exponential notation of the imaginary unit has the form i = e!%/2+27/k, 


where k € Z. Having written the imaginary unit in the base in exponential form and 
using the identity (e*)? = e“”, we obtain 


: * ee 2 . = , 
j= (gna saree =e! (1 /2+27k) =% m/2+20k , where k, k! eZ. 


As is seen from the considered example, the exponential function is a multifunc- 
tion on the set of complex numbers C. 
4.22 Answer: 


(1) J/8i = +211 + 4); 
(2) 74096 € {£4, £211 £173), £2(1 = iV3)}. 


4.23 Proof. 


(1) Transform the exponential notation of the number @x+n/2: 


2ni(k+n/2) 2mik ; ; 
Wk+n/2 =e 7 =e e' =axe™. 


Using the equality e™! = —1, we obtain @k+n/2 = —@ x for even n and k = 
0,1,...,n/2—1. 
(2) The values wz form a geometric progression, whose denominator is @; = 


e?'/", Using the formula for the geometric progression sum 1 + q + q? + 
_ ntl 


tgs i for |q| < 1, we obtain 
n-1 n—1 2mi/nyn 
_ 2mik/n _ (e ioe! _— 
a a ea 
k=0 k=0 
(3) 
nl n—1 


n-1 
Y= 2nik/n (227i > k)/n 
k=0 


n—1 
| [ = I] eotik/n — ek=0 =~ 
k=0 
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: . n—1 n(n _ 1) 
The sum in the exponent is k= a. and therefore, 
k=0 
n—1 
] [ox =e"? = cosa — 1) +isinan(n— 1) = (-1)"". 
k=0 


4.26 Answer: a, 
x? —] 


4.27 Proof. 
no. 
Consider the sum Z = )~ e!**. It is easy to see that the following relations are 


k=1 
valid: 


cosa +cos2a+---+cosna = Re Z, 


sina + sin2a +---+sinna =Im Z. 


Calculate Z, using the formula for the geometric progression sum: 


n ia(n+1) ia 
_ iak __ é —eé 
a= » . = eit — |] 
k=1 
ete/2 
Simplify the obtained expression, multiplying the fraction by | = =) and 


performing simple transformations: 


eia(atl) = el ete/2 eia(n+1/2) = eiat/2 


gia | e-ia/2  pia/2 _ g—ia/2 


Denominator of the obtained fraction is e'@/? — e~!“/? = 2i sin a/2. Rewrite the 
exponents in the numerator, using Euler’s formula: 


= | costin 4 1/2)e] +i sin[(n + 1/2)e] — (Cosa/2 + isin @/2)] 
2i sina /2 


sin[(n + 1/2)a]—sina/2  cos[(n+ 1/2)a] — cosa/2 
2 sina /2 2i sina /2 


Further, use the known trigonometric formulae (see Appendix B “Trigonometric 
Formulae’, formulae (B.16) and (B.18)) 


. : ae a+b 
sina —sinb= 2sin cos : 
2 
. a-b ,atb 
cosa — cosb = —2sin sin ; 
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We obtain 


_ sin(na /2) _sin(na/2) , 
— en) cos[(n + l)a/2] +i me sin[(n + 1l)a/2], 


whence directly follow the Lagrange’s identities. 


4.28 Proof. 

Denote the predicate “(cos y +i sing)” = cosng +i sinng” by P(n) and prove 
the statement Vn P(n) by the mathematical induction method. 

Basis step 

For n = 1 we obtain the valid identity (cosy + ising)! = cosg + ising, 
therefore P(1) is true. 

Inductive step 

Suppose that P(k), k € N is true. Prove the truth of the proposition P(k + 1). 
We need to prove that 


(cosy +i sin g)*t! = cos(k+ l)hg+isin(k + lg. 


Consider the expression (cos g + i sing)**! and represent it in the form 


k+1 
as 


(cosy +i sing = (cosy + ising)* -(cosg+ising). 


According to the inductive supposition, the first factor is 
(cosy +i sin y)* = cos kg +isin kg. 


Then 


k+1 
kr 


(cosy +i sing = (cos ky + isin ky) - (cosg+ising). 


Open the brackets in the obtained expression, using the known identities for 
trigonometric functions, provided in Appendix B, formulae (B.11) and (B.9): 
cos(a + b) = cosacosb — sina sinb, 


sin(a + b) = sinacosb+ cosasinb, 


assuming a = ky, b = g. We obtain 


(cosy +i sin py 


= (coskg cosy — sinkg sin g) 
A 
cos (k+l@ 
+i(sinkgcosg + coskgsin@). 
LS — 


sin (k+l)@ 
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Hence, according to the mathematical induction principle, de Moivre’s formula 
(cosy +ising)” = cosng +i sinng 


is valid for all natural values n € N. 


4.29 Solution. 

Consider a more general case of the problem statement and express cos ng and 
sin ng in terms and cosine and sine of the angle g. 

For this, note that in the left side of de Moivre’s formula (4.1) stands an 
expression that can be expanded by the Newton binomial formula (see page 192). 
Thus, represent the left side in the form: 


n 
(cosy +ising)” = > C(n, j)(cos"~/ y)(i sin gy! 
j=0 


n 
= ye C(n, j) cos"~/9 sin/g. 
j=0 


It is convenient to partition the sum into two sums—by even (j = 2k) and odd 
(j = 2k + 1) values of j, and introduce a new summation variable k € N: 


(cosg +ising)” 
|n/2 
— > i7*C(n, 2k) cos"~**  sin** o 
k=0 
L(a—1)/2] 
+ > i7*+1.C(n, 2k + 1) cos"~7*-! 9 sin2*+1 @ 
k=0 


[n/2] 
= S (—1)* Ca, 2k) cos"~~* @ sin** o 
k=0 
L(m—1)/2] 
ae > (-1)*C(n, 2k + 1) coe Q sin2*t! Q. 
k=0 


Now we only have to take advantage of the fact that cosng = Re(cosg + 
ising)", 
sinng = Im(cosg +i sing)”. We obtain formulae for cosine and sine of a multiple 
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argument: 


|n/2] 
cosng = x (—1)*C(n, 2k) cos”~7* 7) sin2* Q, 
k=0 
L(n—1)/2| 
sinng = >» (—DEC(n, 2k + 1) cos’-24-! @ sin2*+! g, 
k=0 


For n = 3 the obtained formulae take the form 


1 
cos 39 = YE)*c@, 2k) cos>7* Q sin?* g= cos? yg —3cos@ sin? Q, 
k=0 


1 
sin3g = Yep*ce, 2k + 1) coy Q sin2*+1 Qg= 3 cos” yg sing — sin? Q. 
k=0 


4.30 Answer: 


2 
cos 4g = Yep ce, 2k) cost © sin** @ 
k=0 


= cos* g—6 cos” Qg sin? gt sin? Q, 
1 
sin4g = Yep ca, 9k + 1)cos* 74! @ sin?**! 
k=0 
= 4cos? gy sing — 4cos@ sin? Q. 
4.31 Hint. 


Multiply the brackets in the right side of factorization of the polynomial p(z) and 
compare the obtained coefficients at the same powers with the coefficients p(z). 


4.32 Solution. 


(1) Represent ra + i in the form 


2.42 2 
Zp +29 = (1 + 22)° — 22122 


and express the sum and the product of the roots p(z) by Viéte formulae, proved 
in the previous problem: 


z+ eh = (—u)? — 2v = u? — 20. 
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1 1 gt wW—2v 
Oat gage 
1 % 1%2 
4.33 Answer: 


(1) 2 + a = (u2 — 2v)? — 20°; 


1 1 (= 2)? —2v? 
(2) 3 =? 
a 2 v 
4.34 Answer: 
3 3 
QQ) SMu=-3, [[ w=; 
k=1 k=1 
4 4 
(2) So ze =0, [] ze = 20. 
k=1 k=1 
4.35 Answer: 
4 4 
(1) do zm = 100, [] ze =0; 
k=1 k=1 
5 5 
QQ) > #=-l, [-e==L 
k=1 k=1 
4.36 Solution. 


In the determinant, add to the first row the second and the third rows: 


Zi+722+ 23 21 +22 +23 21 +22 + 23 
hes 23 Z1 22 


£2 £3 Z1 


According to Viéte formulae (see Problem 4.31), the sum of all roots of zj + z2 + 23 
is equal to the coefficient of the quadratic term z*, taken with reversed sign. For 
the equation o+tazt B = Owe have z; + z2 + z3 = O, and, therefore, in the 
determinant A the first row entirely consists of zero elements. It is clear that such a 
determinant is equal to zero. 


4.37 Answer: 7; = —2, 22 = 73 = 1. 
5 
4.38 Answer: z} = 7, Z2 = > 73 = 2. 


4.39 Answer: 
T 


(a) [x1, x2, x3] = [2i, —2, i]7; 

(b) [x1, xo, x3]? = [i, 1,2 + 28)"; 

(c) [x1, x2, x3]? = [-3,3, -2+i]'; 
[ f=[ 


(d) [x1, x2, x3]? = [1, -1+ 2i, -2—i]?. 
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4.40 Answer: 

(a) [x1, x2,x3]7 = [21,1 +i, -2 + 2i]'; 
(b) [ 3] [3+ 2i,-3+i,3—i]'; 
(c) [x1, x2, x3] = [2,3-—i,-142i]'; 
(d) [x1, x2,x3]7 = [—2 + 21, 31,2 +i]?. 


4.41 Answer: 


y 


X1,X2; 


X 
X 


y 


(a) [x1, x2, x3, x4, x5]? = [2 +i, i, —l + i, —2, 1 + 2i]'; 
(b) [x1, x2, x3, x4, x5]? = li, —2, -1 ol i, 1 + Lo oi. 


4.42 Answer: Hermitian are matrices 1) and 2). 
4.43 Answer: diagonal elements of Hermitian matrix are valid. 
4.44 Proof. 
Introduce the notation W = 5(ZiZs + ZZ) and find Hermitian conjugate 


matrix relative to W: 
1 - i 1 
w = (5@:2 + zat) =5((Z1Z2)" 4. (2221) ")=5(Z2Z1 + Z\Z2)=W. 


It is proved that W is a Hermitian matrix. 
4.45 Answer: anti-Hermitian are matrices (1), (5), (6). 


4.46 Answer: diagonal elements of anti-Hermitian matrix are purely imaginary 
values. 


4.47 Answer: unitary are matrices (2), (3), (4), (6). 
4.48 Answer: |u|? + |u2|* = 1, g € Ris any real number. 


4.49 Solution. 
In this case, the student is wrong, since for A = 2ziJ, where J is the identity 
matrix, we have exp(A) = e2"'T = T, therefore, In(exp(A)) = OF A. 


4.50 Hint. 

For the diagonal matrix det(e4) = e*!e+2 ... e+" = eA. As for the non-diagonal 
matrix, we either diagonalize it, if possible, or, with any predefined accuracy, 
approximate it by a sequence of matrices, each being diagonalizable 


4.51 Proof. 
Let U be an arbitrary unitary matrix. Using property (6) of Hermitian conjugate 
on page 181, represent the modulus of the determinant U in the following form: 


| det U| = ,/(det U)(det U)* = (det UV) (det U”). 
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Since the product of the determinants of matrices is equal to the determinant of their 
product, VA, B (det A - det B = det(AB)), then 


|detU| = Vdet(UU”) = VdetJ = 1. 


Thus, the modulus of the determinant of the unitary matrix is equal to one. 
4.52 Answer: (01, 02] = 2103, [o2, 03] = 2io1, [03, 0)] = Zion. 

4.53 Answer: 010203 = iI, where J is the identity matrix of size 2 x 2. 
4.55 Answer: Bp = I, where J is an identity matrix of size 4 x 4. 


4.56 Answer: 


00-i O 

000 -i 
aja203 8B = 

i0d00 0 

07 0 O 


4.57 Solution. 
1 
The state of the qubit |0) is described by the matrix . Taking into account 
0 


the matrix representation of the quantum elements H, S and X from Table 4.1, we 
obtain 


setae 1f11sf10} 1 }1 1 | for} >) 
—_> SS = = 

J2\1=1) |O¢ | 21011 VO) 16 i, 

2 


Thus, as a result of the quantum circuit’s action on a qubit in the state |w) = |0), 
it passes to the state 


1-i Dae 
|0) + 


y= 5 


1). 


4.58 Answer: as a result of the quantum circuit’s action on a qubit in the state |W) = 
|1), it passes to the state 
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4. i Answer: a i in the state |v) = u|0) + v|1), passes to the state |y’) = 


(5 (+ du+ 2) 10) + +(5 51 +i) — 50) i). 


Chapter 5 ® 
Vector Spaces ork 


By n-dimensional arithmetic vector, we will mean an ordered set of n real 
numbers. 


The numbers x; (i = 1, 2,...,”) are called coordinates or components of the 
vector x. They are written either in the row: x = (x1, Xx2,..., Xn), or in the column: 
Xx) 
x2 
x=!| |. (5.1) 
Xn 


For designation of vectors and distinguishing them from scalar values, bold font 
is used, for example, a, b, c, etc. ! 

The vectors x = (x1,X2,...,X%n) and y = (yj, y2,..-., Yn) are called equal 
vectors, if the equalities xj = y1, x2 = y2,..., Xn = Yn are valid. 

Sum of the vectors x and y is the vector 


xX+y=(x14+ y1, x2 + y2,-..,Xn + Yn). (5.2) 
Product of the number @ and a vector x is the vector 


ax = (AX1,0X2,..., Xp). (5.3) 


‘Another designation of vector is also used, when an arrow is placed above its symbol. For 


example, using this method, the vectors a and b will be designated as @ and BD. 


© Springer Nature Switzerland AG 2021 217 
S. Kurgalin, S. Borzunov, Algebra and Geometry with Python, 
https://doi.org/10.1007/978-3-030-61541-3_5 


218 5 Vector Spaces 
Difference of the vectors x and y is the vector 


x—y=xt(—-Dy = 1 — y1,%2 — yas -.-5%n — Yn). (5.4) 


Zero vector, or null vector, is the vector 0 = (0,0,...,0) that has zero 
coordinates. It is obvious that x +0=x—-—O=x. 
For the vector x € R, by —x denote the vector with the coordinates 


(—X1, —X2,..., —Xn), (5.5) 


such a vector being referred to as opposite relative to x. 
From the introduced definitions, it follows that —x = (—1)-x andx+(—x) = 0. 
A set of all arithmetic vectors with the operations of addition and multiplication 
introduced on them is called a n-dimensional arithmetic space and denoted by 
R” [30]. 


Example 5.1 BR! is a one-dimensional space (line), R* is a two-dimensional space 
(plane) and R? is a three-dimensional space. 


5.1 Linear Dependence of Vectors in the Space 


Consider a set of vectors x1,X%2,...,x% © RR” and the real numbers 
1,02,..., Ak ER. 

The vector x = a, -X; +a2-X2+---+a,%- x, is called a linear combination 
of vectors x|,X2,...,X,. 


Example 5.2 Let there be given the vectors 


3 4 5 
X1=]-2], x2 =|3], 13 =]3].- (5.6) 

-1 0 7 

-1 
Then, the vector x = 2x; —3x2+x3 = | —10 | is the linear combination of vectors 
5 
X1,%X2,X3. 

A system of vectors x;,X2,...,2X, is referred to as linearly independent one, 


if from the equality a, -x,; +a2-xX2+---+a,z-x, = 0, it follows that a] = a2 = 
-- =a, =0. 
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If there exists a set of real numbers a1, a@2,..., @%, among which at least one is 
not equal to zero, then the system of vectors is referred to as linearly dependent 
one. 


Example 5.3 Given are the vectors x; = (2, —3) and x2 = (4,5). Show that they 
are linearly independent. 
Find the solution of the system of equations 


0) 
ay + a2 = : (5.7) 
0 


a: 2+02:4=0, 
a -(—3) +a2-5=0. 


(5.8) 


Since this system has the unique solution a} = a2 = 0, the vectors x; and x2 are 
linearly independent. 


Note Assume that the vectors x1, *2,..., xX, are linearly dependent. Then, at least 
one of the coefficients a; is other than zero (for example, a; 4 0). In this case, we 
can write 


xy = —-—x2 —-—x3---::- Xk: (5.9) 


Thus, if the vectors are linearly dependent, then one of them is linearly expressed 
in terms of all others [30]. The converse is also true: if one of the vectors of the set is 
linearly expressed in terms of the others, then these vectors are linearly dependent. 
The last property can be considered as a definition of linear dependence of vectors. 


5.2 Basis in the Space R” 


Prior to introducing the concept of basis in the vector space R”, let us prove the 
following theorem. 


Theorem 5.1 Any system of n + 1 vectors in the space R" is linearly dependent. 


Proof Take arbitrary n + 1 vectors 
Xj = (X1i, X2i,--.,Xni), (5.10) 


wherei = 1,2,...,n+1. 
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Construct their linear combination and equate it to zero vector: 
ay Xp +a2-xX2 +--+ +On41 -Xn41 = 0. (5.11) 


Writing this equality in a coordinate-wise manner, we arrive at a system of n 
equations with n + 1 unknowns a, @2,..., Q@n+41: 


(5.12) 


1+ Xnt + 02+ Xn2 +--+ + On41 -Xnn+1 = O. 


The matrix of the system (5.12) differs from the respective augmented matrix 
only in the zero column, this is why their ranks coincide. Therefore, according to 
the Kronecker—Capelli theorem, the system has infinitely many solutions. They nec- 
essarily include a non-zero solution. Thus, there exists a non-zero set of coefficients 
1, 02,...,@n+41, for which the linear combination of vectors x1, X2,...,Xn+41 is 
equal to a non-zero vector. Therefore, the vectors x;, where 1 < i < n+ 1, are 
linearly dependent. 

Any system of n linearly independent vectors 51, b2,..., by is called a basis of 
a vector space. 

Consider in the space R” the system of vectors: 


(5.13) 
én = (0,0,..., 1) 
These vectors are linearly independent, since from the condition 
aye; +-:-+Qn,-e, =9 (5.14) 
directly follows the system of equalities a} = a2 =--- =a, = 0. 
The vectors e;, €2, ..., @n are called normalized vectors of the space R”; they 


form the basis in this space. 

Conclusion: a linearly independent system of vectors in R” can have a maximum 
of n vectors. 

Consider a system of n vectors 


Xj = (X1j, X2;,...,Xni), Wherei = 1,2,...,n. (5.15) 
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Construct a matrix of coordinates of the vectors: 


X11 X12 ---X1n 
X21 X22... X2n (5.16) 


Xn| Xn2---Xnn 
Such a matrix is called a matrix of a system of vectors, and its determinant is 
called a determinant of a system of vectors. 


Theorem 5.2 Jn order for a system of n vectors to be the basis, it is necessary and 
sufficient that the determinant of the system is other than zero. 


Proof Consider an arbitrary system of n vectors 
Xi = (X11, X2i,---,%ni), T= 1,2,...,n. (5.17) 


Construct their linear combination and equate it to zero vector: 


1 - X47 +A2- XQ +++ + An Xn 
Q) + X21 +Q2°-x22 +++: +n - Xx 0 
1 21 2, 22 n 2n - ce (5.18) 


Oy + Xn] +A2+-Xn2 +++ + An + Xnn 0 


We will obtain a homogeneous system of n equations with n unknowns and a 
determinant other than zero. In this case, such a system has only a zero solution, i.e. 
a; = 0, where i = 1,2,...,m, and the system of vectors is a basis. 


Theorem 5.3 Assume that the vectors x,,X2,...,Xn form a basis. Then, any 
vector y of R” can be represented, uniquely, in the form of a linear combination 
of the vectors x; (i =1,2,...,n): 


YHay- xX, +0a2°-X24+-+++ An: Xp. (5.19) 


Proof Write the expansion (5.19) in projections: 


1+ X41, $O2-xX12 +--+ +An- Xin Bal 

Oy + X21 $O2-x22 +--+ +n + X2n y2 
OX] + A2QK) +++ + AX, = = 

Oy + Xpi +2 + XyQ2+ +++ + An > Xnn Yn 


(5.20) 
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We have obtained a non-homogeneous system of n equations with n unknowns. 
Since the determinant of this system is other than zero, by virtue of Cramer’s rule, 
this system has the unique solution. 


Note The formula (5.19) is called an expansion of the vector y in the basis x; (i = 
1,2,...,n). 


Example 5.4 Show that the vectors a = (1,1,4), b = (0,-—3,2) andc = 
(2, 1, —1) form a basis. 
Construct the determinant of the system of vectors and compute it: 


10 2 
1-3 1 |=1-G6-2)4+2-@Q+412)=29. (5.21) 
42 -l 


Since the determinant of the system is other than zero, the vectors a, b and c 
form a basis. 


Example 5.5 Expand the vector d = (6, 5, —14) in the basis (a, b, c) (see previous 
example). 
Represent the vector d in the form of the expansion: 


d=a,:a+a2-b+a03-Cc. (5.22) 
We have 
6 1 0 2 
5 =a, }1}+a2]-3}+a3] 1 |. (5.23) 
—14 4 2 —l 


Write this equality in the form of the system of linear equations: 


ay +203 = 6, 
a, —3a2 +a3 = 5, (5.24) 
4a; +202 -—a3 = —14. 

The obtained system has the unique solution a] = —2,a2 = —1,a3 = 4. Thus, 


d=-—2a—b+4e. 
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5.3 Euclidean Vector Space 


In an arbitrary n-dimensional vector space, it is possible to introduce a scalar 
product (known also as inner product, or dot product)—the rule according to 
which the two vectors a, b € R” are associated with the number (a - b). The scalar 
product suggests such analogues of a spatial arrangement of the multidimensional 
vectors R” as orthogonality and collinearity. 

By definition, the scalar product of the vectors a = (a1,d2,...,@,)) and b = 
(bi, bz, ..., by) is computed by the formula: 


(a- b) =a,b, + dgby +--+ + ayby. (5.25) 


Note that with the help of the summation sign, the variable (a - b) is compactly 
n 
written as (a- b) = D> ajbj. 
i=l 


Let us enumerate the properties of a scalar product. 
For arbitrary a, a), a2, b € R" anda € R, the following equalities are valid: 


(1) (a-b)=(b-a) (symmetry); 

(2) (a1 + 2) +b) = (a -b) + (a2 +b) (linearity); 

(3) (7a-b) =a(a-b) (linearity); 

(4) (a-a) > 0,and (a-a) =0 Sa=0 (non-negativity). 


Note For the scalar product of the vectors a and B, the designations (a, b), a - b or 
ab are also used. 


Example 5.6 Let n = 4, and in the coordinate notation, the vectors have the form, 
a = (10, —2,1,9),b = (0, 3,4, —2) and c = (—12, 2, —4, —5). Then, 
a-b=10-0+ (—2)-3+ 1-449. (—2) = —20, 
a-c=10-(—12)+ (—2)-24+1-(—4) +9. (—5) = -173, 
b-c=0- (-12)+3-2+4- (—4)+ (-2)- (-—5) = 0. 


Example 5.7 Show that if the condition (a - t) = (b- f) is valid for all t € R”, then 
the vectors a and b are equal to each other. 


Proof Based on the property of linearity, represent the equality (a - t) = (b- t) in 
the equivalent form 


(a-t)=(b-t) & (a—b)-t) =0. (5.26) 


Into the obtained equality, substitute ¢ = a — b. Then, according to the property 
of non-negativity, we have (a — b) = 0 ora = B, which is what we set out to 
prove. 
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Length, or norm, of a vector is the value ||a|| = /(a- a). 
Note the following easily provable properties of the norm, which are valid for 
arbitrary vectors a and b of Euclidean space: 


(1) |la|| > 0, and |la| =0 Sa =0; 
(2) ||wa|| = abs(@)||a|| for alla € R; 
(3) la + ll < lla|| + ||dl. 


The last inequality is referred to as a triangle inequality or Minkowski? 
inequality. 


Example 5.8 For the vectors a = (0, —1, —1, 3, 1) and b = (5, —3, 0, —2, —1) in 
the space IR, we have 


lal = /@-a) = J0+ (-12 + 124+ 32+ 2 = V12; 
|b = /@- b) = V(—5)? + (—3)? + 0? + (—2)? + (-1)? = V39. 


Orthogonal are the vectors whose scalar product is equal to zero. Usually, the 
orthogonal vectors are designated as a | b. 

A set of vectors, where all vectors are pairwise orthogonal, is naturally called 
orthogonal. If in such a set all vectors have a unit norm, then such a set is 
orthonormal. 

Of course, in an arbitrary basis, the vectors might not possess the property of 
pairwise orthogonality, a fortiori orthonormality. Show that there exists a possibility 
to construct a new basis from the original one, and in the new basis, all the vectors 
will be pairwise orthogonal. Such a procedure is called the Gram?—Schmidt* 
process (orthogonalization). 

Assume that in a vector Euclidean space with a norm, a basis (pj, p2,..., Pn) iS 
set. The procedure of constructing a new orthonormal basis consists in performing 
the following steps. 


Successively compute the vectors q1, q2,.--, Yn by the formulae: 
t a 
Lh Pi, q\ == 
IIe 
t2 


to = po—(p2,91)41, G2 =—, 
I|¢2I 


?Hermann Minkowski (1864-1909), German mathematician. 
3y grgen Pedersen Gram (1850-1916), Danish mathematician. 
4Erhardt Schmidt (1876-1959), German mathematician. 
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t3 = p3 — (p3,491)41 — (p3,492)92, 93 = —— 


a 
tn 
th = Pn — (Pn, 9191 — + — (Das Qn-1)4n-1, An = Ten’ 
n 
The obtained basis (q1, 42, .--,@n) is orthonormal. 
5.4 Eigenvalues and Eigenvectors of a Matrix 
Two matrices A and A’, bound by the relation 
A’ = P7'AP, (5.27) 


where P is some invertible matrix, are referred to as similar. In this case, the 
designation A’ ~ A is used. 


Note Transition from the matrix A to A’ is called similarity transformation. 


1 -—3 —21 
Example 5.9 Matrices and are similar, since the following 
-1 2 —95 
equality is valid: 
-1 
—21 1 -l 1 -—3 1 -1l 
= : (5.28) 
—95 —2 1 -1 2 —2 1 
-1 
Its a =i! ee 
Indeed, = , and the equality (5.28) is easily verified by 
—2 1 —2-1 


a direct multiplication of the matrices. 


Theorem 5.4 (On the Matrix Similarity Properties) The following properties of 
similarity are valid: 


(1) A ~ A—+veflexivity; 
(2) A~ B= B ~ A—symmetry; 
(3) (A ~ B) and (B ~ C)) > (A ~ C)—1ransitivity. 


It follows from the theorem on the matrix similarity properties that the similarity 
of matrices is an equivalence relation [1, 41, 53]. 
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Two similar matrices have equal determinants. Indeed, from the definition 
of (5.27), it follows that 


|A’| =|P7!AP| = |P7!||A||P| = Al. (5.29) 


Note that the equality of the determinants does not at all imply the similarity of 
the matrices. 


0 
Example 5.10 Find out whether the following matrices are similar: and 


10 
iti 


The determinants of these matrices are equal to —1 and 1, respectively. Therefore, 
these matrices do not possess the property of similarity. 


The number 4 and the non-zero vector b are referred to as eigenvalue and 
eigenvector of the matrix A, respectively, if the following equality is valid: 


Ab=)db. (5.30) 


The vector b is considered as a column vector. In order to find b and 4, represent 
Eq. (5.30) in the following form: 


A=ib=o, (5.31) 


where J is an identity matrix. 

We have obtained a homogeneous system of linear equations. In order for it to 
have a non-zero solution, it is necessary and sufficient that the determinant of the 
matrix A —AJ is equal to zero. Thus, in order to find A, we should solve the equation 


|A —AI| =0 (5.32) 
or, in an expanded notation 
ay—-A ay ain 
a2) a22—A a2n _0. (5.33) 
; - Cyeah ; : eeeeg ack - ‘ - 


This equation is referred to as characteristic equation. 
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If we expand the determinant, we obtain a polynomial of power n relative to the 
variable 1: 


pd) = (—A)" + hy (—Ay! es + tn (—A) + han, (5.34) 


and the following properties are valid: 


¢ the coefficient 1; is equal to the trace of the matrix A: h; = trA = aj; +a22 + 


7+ + dyn3 
¢ the constant term h, coincides with the determinant: h, = det A. 


The polynomial (5.34) is also referred to as characteristic. 

It is known that characteristic polynomials of similar matrices coincide. 

According to the fundamental theorem of algebra, the characteristic equa- 
tion (5.30) has no more than n solutions. For each solution A, it is associated with 
the eigenvector b. 


Note Although the eigenvalue can be equal to zero, the eigenvector, by definition, 
is always different from the zero vector. 


Example 5.11 Find the eigenvalues and eigenvectors of the matrix 


113 
A=/151]. (5.35) 
311 


Compute the determinant of the matrix A — AJ and equate it to zero: 


|A-Al|=| 1 5-a 1 


=(1-A)-[5-A)-d—-A)—- 1]-d—-A)4+34+3-[1-3-6-A)] 


= (1—A)- (7 —6-A43)+9-A—39 = -17 47-17 — 36 
= —(A+2)-(a7—9-A4 18) =0. 
Solving this equation, we will obtain three roots, 4; = —2, A2 = 6 and A3 = 3. 
For each A, find the eigenvector associated with it. 


1. Let A = —2. Then, 


313 
A-A;IJ=1171]. (5.36) 
313 
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Assuming that b = (x, y, z), we will obtain the system of equations 


3x+y+3z=0, 
x+7y+z=0, (5.37) 
3x +y+3z=0. 


Write the matrix that corresponds to this system: 


313 
Tega be (5.38) 
313 


Add to the third row of this matrix the second one, multiplied by (—1): 


313 
jay fa a (5.39) 
000 


Drop the zero row and exchange places of the second and the first rows. Then, 
we have 


171 
313 


(5.40) 


Bring the matrix to echelon form; add to the second row the first one, 
multiplied by —3. We obtain 


17 1 


(5.41) 
0 —20 0 
Proceed to the equations and write 
x +7y +z = 0, 
(5.42) 
y = 0, 
orx+z=0. 
As a free variable, select z. Then, assume that z = 1, and then x = —1. 


Thus, b} = (—1,0, 1). 


5.4 Eigenvalues and Eigenvectors of a Matrix 229 


2. For 1 = 6, we obtain the system: 


—5x+y+3z=0, 


x—y+z=0, (5.43) 


3x + y—5z=0. 


In order to find the vector b2, write the matrix of this system and its 


transformations: 


—5 1 3 1-1 1 1-1 1 


1-1 1 
1-1 1}7)-5 1 3 |/7>];0-4 8 | > 
0 1 -—2 
3 1 -5 3 1 —-5 0 4 -8 
Proceed to the system of equations: 
ale ea (5.44) 


y = 2z. 


Assuming that z = 1, we find y = 2 andx = 1. 
Thus, b2 = (1, 2, 1). 
3. For A = 3, we have the system: 


—2x+y+3z=0, 
x+2y+z=0, 
3x + y—2z=0. 


(5.45) 


In order to find the vector b3, we perform similar equivalent transformations: 


—21 3 121 1 26 1 
121 
121 );7>);-213}/7/05 5 ]|7> 
O11 
3. d2 3 1-2 0 —5 —5 
Proceed to the system of equations: 
2y = -Z, 
ii tes aaa (5.46) 


y=-2Z. 


Assuming that z = 1, we find y = —l andx = 1. 


230 5 Vector Spaces 


Thus, the matrix A has the following eigenvalues and eigenvectors corre- 
sponding to them: 


Ay =—2, b; = (-1,0, 1), (5.47) 
A2 = 6, b2 = (1,2, 1), (5.48) 
43 = 3, b3 = (1,-1, 1). (5.49) 


Note In physics, the characteristic equation is sometimes referred to as the secular 
equation since such equations appeared during the analysis of the motion of the 
Solar system’s planets and their satellites over considerable periods of time (referred 
to as “secular” motions) [2]. 


Annihilating polynomial for the matrix A is such polynomial p(x) whose value 
of this matrix is equal to the zero matrix: p(A) = O. 


Theorem 5.5 (Cayley--Hamilton® Theorem) For any square matrix A, the char- 
acteristic polynomial is its annihilating polynomial. 


Example 5.12 Let us illustrate the Cayley—Hamilton theorem with the help of the 


1 —10 
characteristic polynomial of the matrix A = : 
-—6 5 
1—2A —10 2 
Indeed, p(A) = det|A — AJ| = =h* — 64 —55. 
—6 5-A 


Check whether the equality p(A) = O is valid: 


2 
i: = 10 i 16 10 
p(A) = ~6 — 55 
-6 5 ~6 5 01 
61 —60 ~6 60 —55 0 00 
= + + 7 
—36 85 36 —30 0 —55 00 


Then, p(A) is the annihilating polynomial for the matrix A. 


Recall that the similar matrices A and B are bound by the relation B = P~'!AP 
for some nonsingular matrix P. Selecting P composed of the columns equal to the 
eigenvectors A (written in random order), we obtain the diagonal matrix B. This is 
the point of the procedure of diagonalization of the initial matrix. 


5 Arthur Cayley (1821-1895), English mathematician. 
William Rowan Hamilton (1805-1865), Irish mathematician and physicist. 
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Note that the sufficient condition for the possibility of diagonalization is the 


presence of different eigenvalues of the initial matrix, and their number should 
coincide with its order. 


Review Questions 


— 
SGTOMmAAINNFWN KE 


eet enti oe till oti oo’ 
nABWN Re 


ee 
ND 


. Define n-dimensional arithmetic vector. 

. Enumerate the basic operations of vectors. 

. What is n-dimensional arithmetic space? 

. How is a linear combination of a system of vectors constructed? 

. What system of vectors is referred to as linearly dependent? 

. Define basis of a vector space. 

. Formulate the criterion that an arbitrary system of vectors is the basis. 

. Explain how a scalar product of vectors can be introduced into a vector space. 
. Enumerate the basic properties of a scalar product. 

. How is the norm of a vector found? 

. What is the Gram—Schmidt orthogonalization procedure used for? 

. What two matrices are called similar? 

. Enumerate the properties of similarity of matrices. 

. Define the concepts of “eigenvector” and “eigenvalue” of a matrix. 

. How can one, knowing the elements of the matrix, set up its characteristic 


equation? 


. Formulate the Cayley—Hamilton theorem. 
. What is the sufficient condition of diagonalization of a matrix? 


Problems 


5.1. Find out whether the vectors p, q and r form a basis in a three-dimensional 


vector space. If they do, expand the vector x in this basis. 


2 1 4 3 

GQ) p=]1),.9=]0].r=]2],.x=] 1]. (5.50) 
0 1 1 3 
5 2 1 13 

(2) p=]1]/.94=]-1].r=| 0 |.x=] 2]. (5.51) 
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4 2 -1 —9 
(3) pP=|1]}.9g=] 0 }.r=] 2 |.*=] 5}. (5.52) 
1 —3 1 5 


5.2. The vectors e, e2, e3 and eg are specified by their coordinates in some basis. 
Show that the vectors e;, e2, e3 and e4 themselves form a basis, and find the 
coordinates of the vector x in this basis: 


1 2 1 1 7 

2 3 2 3 14 
a= ,e2= »E3= »e4= x= 

-1 0 1 -1 -1 

—2 -1 4 0 2 


(5.53) 


5.3. Check that the vectors a = (1, 2, 3), b = (—3, —2, 3) ande = (0, —2, —2) 
are linearly independent and thus form a basis. Is it orthogonal? Is it 
orthonormal? If the answers are negative, then use the Gram—Schmidt 
orthogonalization process to construct an orthonormal basis. 

5.4. Check that the vectors a = (3,—1,1,2), b = (—3,1,0,-2), c¢ = 
(0, —2, 2, —2) and d = (1,4, 2, —7) form a basis in R*. Is it orthogonal? 
Is it orthonormal? If the answers are negative, then use the Gram—Schmidt 
orthogonalization process to construct an orthonormal basis. 

5.5. Prove that the set of vectors {(i, 2—i, 5), (1, 2+i, —i), (1, i, —1)} is a basis 
in the vector space C3. What are the coordinates of the vectors (1, 0, 0), 
(1, 1, 0) and (1, 1, 1) in this basis? 

5.6. Assume that the vectors vj = [ay, ao]? and v2 = [b, bo]? are linearly 
independent. What can you say about the linear dependence or independence 
of the vectors w; = [a1, by]? and w2 = [a2, by]? 

5.7. Prove that the set M = {Mpg} of all matrices with p rows and q columns 
with real elements forms a vector space relative to the operations of matrix 
addition and matrix multiplication by a number. 

5.8. Prove the Cauchy’—Bunyakovsky® inequality (also referred to as the 
Cauchy-Schwarz’ inequality): 

For arbitrary vectors x, y € R”, the following relation is valid: 


(x-y)* <(x-x)(y-y), (5.54) 


7 Augustin-Louis Cauchy (1789-1857), French mathematician and mechanician. 


8Viktor Yakovlevich Bunyakovsky (1804-1889), Russian mathematician and mathematics histo- 
rian. 


°Karl Hermann Amandus Schwarz (1843-1921), German mathematician. 
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5.9. 


5.10. 


5.11. 


5.12. 


5.13. 


5.14. 


5.15. 


5.16. 


and the equality is valid if and only if the vectors x and y differ in the scalar 
factor, i.e. are proportional. 
Prove the Pythagorean!” theorem: 

If the vectors x, y € R” are orthogonal, then the equality 


Ix + yll? = xl? + Wy? (5.55) 


is valid. 
Check the validity of the identity 


Ix + yll? + lx — yll? = 2x lI? + yl?) (5.56) 


for arbitrary elements of the n-dimensional vector space R”. What is the 
geometric sense of this identity in the spaces R* and R?? 

It is known that the equalities ||x|| = 6, ||x + y|| = 10 and ||x — y|| = 12 
are valid. What is the variable ||x ||? 

Find the maximum number of linearly independent vectors 


(1) on the plane; 
(2) in the three-dimensional space; 
(3) in R”. 


Check that the system of vectors 
iT ts, all Oye cd AO Wee) oecss (0,0.08...91 7° 


forms a basis in R”. 
Is the system of vectors 


(Udell. 1j8Sceccnls 123% cael, cou 
eS ae an 

a basis in R”? 
0 

Show that the matrices and , where a € R, are similar. 
aQ 00 

Find whether the matrices A; and A are similar: 

00}. 


? 


a 
(1) A; = and A> = 
0 0 11 


!0Pythagoras of Samos, Tu@aydéea¢ 6 Ud&wto¢ (about 570 B.C.—about 495 B.C.), Ancient Greek 
philosopher and mathematician. 
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100 101 
(2) Ai =|]010] andA2=]100}. 
000 010 


5.17. Is it true that the traces of similar matrices coincide? 
5.18. Find the eigenvalues and eigenvectors of the matrix A. 


4-52 i 1 a 
GQ) A=|]5-73]; @ A=j)-4-1 0}; (5.57) 
6-94 lea Sec 
7 ae ae 0 10 
3) A=| 5 33 |: @ A=|-a4ol; (5.58) 
= a) 212 
1:a3°3 
(5) A=] -—2-613}]. (5.59) 
eee 


5.19. Diagonalize the matrix, i.e. bring the matrix to diagonal form: 


4 15 -3 
A=/8 -3 3 
0-15 7 


5.20. Bring the following matrices to diagonal form: 


(1) 
4 1 4 
A= 6 3.°6: 3 
ATT 5. 11 
(2) 
—~23 —16 —28 


A=] 58 39 64 
—11 —7 -10 
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5.21. Bring the matrix that depends on the real parameter a to diagonal form: 


010...0 0 
001...0 0 
Q=] oe. 
000...0 1 
w200...0 0 


*5.23. Using the Cayley-Hamilton theorem, compute the n-th power of the matrix 
Q, found in the previous problem. 
*5.24. Find the value of the limit: 


n 


1 
jini Geel |i. (5.60) 


noo —p/n 1 


5.25. Bring the complex matrix to diagonal form: 


1—2i 2i 2 
Z= 0 i 0 
i -—20 


5.26. It is known that two out of three eigenvalues of the matrix 


910 1013 + 571 —1013 + 57i 
Y=] 57-8991 68—1070i 57+ 1013i 
—57 — 899i 57 — 1013i 68 + 1070: 


are equal to 11 + 57i and 11 — 57i. Without solving the characteristic 
equation, find the third eigenvalue. 
5.27. Prove that the eigenvalues of Hermitian operator are real. 
*5.28. Prove that all eigenvalues of the unitary matrix lie in a complex plane, on a 
unit circle with the centre at the origin of coordinates. 
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5.29. Compute the eigenvalues and eigenvectors of the Pauli matrices 01, 02 and 
03 (see page 185). 


Answers and Solutions 


5.3 Solution. 
Write the matrix A, composed of the coordinates of the vectors a, b and c: 


1-3 0 
A= |2-—2-2 
3 3 -2 
Since detA = 16 ¥ 0, then the vectors are linearly independent and form a 


basis. The basis is neither orthogonal, because, for example, (a - b) = 2 4 0, nor 
orthonormal, because ||a|| = /14 4 1. 
In order to construct an orthonormal basis, apply the Gram—Schmidt algorithm: 


ti 1 


’ m= a= (1, 2, 3); 
ill = =V14 


t) =az= 


to =b—(b-qi)gi 


= 1 = 
oi) 5) oe page ola |g 
fe J14 J14 ee 2 eee 
3 3 9 
=i 
to 1 
q.= —— = 
It2||  /266 
9 
t3 =e — (€-qi)qi — (€- q2)q2 
0 1 
=| 25 : (05139: 953-9) : 2 
7 J14 J14 


2 3 
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=f 3 
10. (=11) — 2: (-8) — 2-9) — ee 31; 
/266 066) ° | Io) 
9 1 
3 
t3 1 
Qa = — = — 
lé3|] = /19 


As aresult, we obtain the orthonormal basis: 


| 
a 
= 
we 


5.4 Solution. 
Write the matrix A, composed of the coordinates of the specified vectors: 


3-30 1 

-1 1-24 
A= 

102 2 

2252.27] 


Its determinant is equal to det A = —72 ¥ 0. 

It is clear that rk A = 4; then, according to the basic minor theorem, the vectors 
a = (3,—1, 1,2), b = (-3, 1,0, —2), c = (0, —2, 2, —2) andd = (1,4, 2, —7) 
form a basis in the arithmetical space R*. 

This basis is neither orthogonal nor orthonormal, because, for example, (a -b) = 
—1440and(a-a)=154 1. 

In order to construct an orthonormal basis (q1, 92, q3,q4), apply the Gram— 
Schmidt algorithm: 


ty} 1 


t) =a=(,-1,1,2, gi1=—=—= 
fil 15 


(3, -1, 1, 2); 


to = b—(b-q\)q1 


eg eee a ee ce ee a 
da v5 VIS|a | 15] 44] 


238 


q2. = Ta, 
a 


t3 =e—(c-qi)q1 


_f3 


~ Tel | 


ty = d—(d-q\)q\ 


ft 


2] 
14 Teall 3 | 
L 


a 


0 
4 


=| 


— d-q2)q2 — (4 -93)43 


—2 
— (€-42)4q2 = | i Oegi= 
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5.5 Solution. 
Construct a matrix A of the coordinates of the specified vectors: 


I 1 1 
A= |2-i2+i i 
5 -—i —-l 
Its determinant is det A = —8 — 6i ¥ O, and therefore, the set of vectors forms a 


basis in the space C?. 

Find the coordinates of the vectors a = (1,0, 0), b = (1, 1,0) andc = (1, 1, 1) 
in this basis. 

The vector a in the basis (€1, e2, e3) has the coordinates (a1, a2, a3) that satisfy 
the system of equations, which in matrix notation has the form: 


i 1 1 a 1 
2-—i2+i i a2|=1|0 
5 —i -l a3 0 


In order to solve the obtained system, let us use Cramer’s rule, according to 


which, fori € {1, 2, 3}, the equalities a; = % are valid. 


A= -8 — 61, 
1 1 1 
Ai =|02+i i |=-—3-i, 
0 -i -il 
i 1t 
Ar. =/2-i0 i |=2+4+4i, 
5 0-1 
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i 1 1 
A3 = |2—i12+i0)=—11—7i. 
5 —-i 0 


Hence, we obtain the following coordinates: 


= | 1 
a = (a1, a2, a3), where a, = = = G8 -i.a2 = — ZH 4 + 2i) and 


1 
G39), 
a3 10° i) 


Write the system of equations for the coordinates of the second vector b: 


i 1 1 by 1 
2-i2+i i b, | = 
5 -i -1]| |b 0 
11 1 
Ai = |12+i i |= —2-2i, 
0 -i —-l 
i ltl 
A2 = |2-i1 i |=—3+4+3i, 
5 0-1 
i 1 1 
A3 = |2-i2+i1)/=—-7-Ti. 
5 -i 0 


1 1 
The values of the coordinates are bj = 357 +i),b. = 50° — 21i) and 


1 
b3 = 5019 + 7i). 
Finally, the coordinates of the vector ¢ satisfy the following system of equations 


in matrix form: 
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1 1 1 
Ai =/12+i i |=—4—-2i, 
1 -—i -1 
ili 
Ad. = |2-i1 i | =2i, 
5 1-1 
i 1 1 
A3 = |2—i2+i 1) =—10—-4i. 
5 -—i l 


1 1 
The coordinates of the vector c are equal to cy = 5 (11—2i),co. = — 5 (3+4i) 
1 . 
and c3 = 75 (26 —7i). 


5.6 Solution. 

The criterion, i.e. the necessary and sufficient condition of the linear indepen- 
dence of the system of vectors v; and v2 is the determinant being not equal to zero, 
which determinant is composed of their coordinates: 


a, bj £0 


az b2 


As is known, transposition of a matrix does not change its determinant. There- 
fore, there exists the inequality 


a\ a2 


by bg 


#0, 


and the vectors w; = [ay, by" and w2 = [a2, bo" are independent. 


5.7 Proof. 

Each matrix from the set / can be presented as a numerical sequence of length 
p x q. Indeed, for this, it is enough to write the matrix elements row by row into a 
vector, or, in other words, into a one-dimensional array of size p x q. Since the 
vectors of the same size form an arithmetical space, then the set M = {Mpq} 
also forms a vector space relative to the operations of matrix addition and matrix 
multiplication by a number. 
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5.8 Proof. 

Let us introduce for consideration the vector z = x — Ay, where A is some real 
number. 

Based on the fourth property of scalar product, (z, z) > 0, or 


(x — Ay) (@ —Ay)) = +x) — 2A) +17(y- y) FO. (5.61) 


This inequality should be valid for any A € R. Note that the left side of the 
inequality (5.61) is a quadratic trinomial. The necessary and sufficient condition of 
its non-negativity is non-positivity of the discriminant: 


(x- y)*—(x-x)(y- y) <0. 


The obtained inequality, as it is easy to see, after transferring the summand (x - 
x)(y - y) to the right side, coincides with the Cauchy—Bunyakovsky inequality. 

The equals sign will occur if and only if z = 0, i.e. x and y are proportional and 
differ in the scalar factor. 


5.9 Proof. 
From the definition of vector length and the properties of scalar product, follow 
the equalities 


x+y? =(a@ty)-(&+y)) = @ D+ yY+tQ-x)4(y-y) = II? + IyI?. 


Thus, the Pythagorean theorem is proved for all orthogonal vectors x, y € R”. 
5.10 Solution. 
Transforming the squares of norms in the left side of the identity, we obtain 
lx + yl? = (e+ y)- @ Hy) = lel? + 20> y) + IYI, (5.62) 
lx — yl? = — y)- @— y)) = Ix? — 2@- y) + III. (5.63) 
From these relations, follow the equality (5.56). The geometric sense of this equality 


consists in that the sum of the squares of the parallelogram’s diagonals is equal to 
the sum of the squares of the sides. 


5.11 Solution. 
Express the variable || y||, using the identity (5.56) from the previous problem: 


lly ll = y Cle + yll? + lle — yll?)/2 — Ix. 


Having substituted the numeric data, we obtain ||y|| = (100 + 144)/2 — 36 = 
86. 
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5.12 Answer. 


(1) The maximum number of linearly independent vectors equals to 2; 
(2) 3; 
(3) 1. 


5.13 Solution. 
The matrix of the system of vectors under consideration has a lower triangular 


10...0 0 
11...0 0 
form: | ........... 
11...1 0 
WD ce FOL 


Its determinant is equal to the product of the elements located on the main 
diagonal. Therefore, the vectors are linearly independent and form a basis in the 
vector space R”. 


5.14 Answer: It is. This is easily seen from the Vandermonde determinant being not 
equal to zero in this case (see (2.74) on page 67). 


5.15 Solution. 
It is easy to verify that the equality is valid: 


Oa _;|00 
a ad P; 
00 a0 
01 st ae : 
where P = . Therefore, by definition of similarity relation (5.27), the 
10 


matrices mentioned in the statement of the problem are similar. 


5.16 Answer: 
-1 
0 1 01 

(1) Yes, A2 = A, 3 

-10 -10 
(2) No, since the equality of the determinants of these matrices is not fulfilled. 
5.17 Solution. 

Yes, as follows from Problem 1.44, for any matrices A and the invertible matrix 

P, the equality 


tr(P-'AP) =tr(AP-!P) =trA 


is fulfilled. 
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5.18 Solution. 


(1) As is known, the eigenvector of the matrix A is such vector u, in which 
multiplication of A by u results in the vector Aw, where A € R is the eigenvalue. 
Write an equation of the form Au = Au: 


4-52 uy uy 
5-73 uz =k ud); 
6-94] | u3 U3 


or in the coordinates of the vector u: 


4u, — 5u2 + 2u3 = Au, 
Su, — 7u2 + 3u3 = du2, 
6u, — 9u2 + 4u3 = dAu3, 


(4 —A)u, — 5u2 + 2u3 = 0, 
Su, — (7+ A)u2 + 3u3 = 0, 
6u, — 9u2 + (4—A)u3z = 0. 


Note that the eigenvector cannot be zero by definition: 


ul 0 
ur| #10 
U3 0 


Therefore, the equations are linearly dependent and the determinant of the 
system matrix is equal to zero: 


(4 — 2) Su2 2u3 
5uy —(7+A)u2 3u3 = 0. 
6u} —9u2 (4—A)uz3 


Having computed the determinant, we obtain the characteristic equation: 
7-13 =0. 

The eigenvalues are Aj,2 = 0 and A3 = I, so zero is an eigenvalue of 
multiplicity two. 
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Find the respective eigenvectors: 
LetrA = 0. 


4u, — 5u2 + 2u3 = 0, 
Su, — 7u2 + 3u3 = 0, 
6u, — 9u2 + 4u3 = 0. 


Having solved this system, we obtain 


uj uy 
uy = 0, u2 = 2, uz = 3u1, or | u2| = | 2u4 Ju, ER. 
U3 3uy 


1 


The eigenvector is | 2 }. 


3 
LetA = 1. 


3u, — 5u2 + 2u3 = 0, 
Su, — 8u2 + 3u3 = 0, 
6u, — 9u2 + 3u3 = 0. 


Having solved this system, we obtain 


ut u2 
uy = U2, Our = Our, U3 = U2 OF | uy | = | uz |, U2 © R. The eigenvector 
U3 u2 
1 
is |] 
1 


(2) By analogy with the solution from the previous item, write an equation of the 
form Au = Au. Then, we obtain the characteristic equation: 


—(A+2)(A-1)?=0. 


The eigenvalues are equal to Ay = —2 and A23 = 1. 
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Let us find the eigenvectors. 


For A = —2, 
Su; t+u2 =0, 
—4u; + u2 =0, 
4u,; — 8u2 = 0, 
uy = 0,u2 =0,u3 ER. 
0 
The eigenvector is | Q }. 
1 
For A2,3 = 1, 
2u; +u2 =0, 
—4u, — 2u2 = 0, 
4u, — 8u2 — 3u3 = 0, 
20 
u2 = —2u1,u3 = U1, 
3 
uy uy 
u2|= —2u Ju, ER 
20 
=u 
U3 3 1 
3 
The eigenvector is | —6 |. 
20 


(3) The characteristic equation: 347-30 -1=0. 


The eigenvalue is multiple of three: A = —1. 


3u, — u2 +2u3=0, 
5u, — 2u2 + 3u3 = 0, 


Uy —u3=0. 
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Find non-trivial solutions: uw; = —u3 and u2 = —u3. 
-1 
The eigenvector is | —1 
1 


(4) The characteristic equation has the form: —(A — a8 = 0. 
The eigenvalue is A = 2. 
Find the eigenvector for 7 = 2: 


—2u,;+u2=0, 
—4u, + 2u2 = 0, 
—2u; + u2 =0, 


then u = c;[1/2, 1, 0]? + c2[0, 0, 1]”. 
The eigenvectors: [1, 2, 0]”, [0, 0, 1]. 
(5) The characteristic equation: —13 4347 -3441=0. 
The eigenvalue 2 = 1 is multiple of three. 


—3u2 + 3u3 = 0, 
—2u, — 7u2 + 13u3 = 0, 
—u, — 4u2+ 7u3 = 0, 


and therefore, uz = u3 and u, = 3u3. 


U3 ,u3 ER. 


3 


The eigenvector is | 1 
1 


5.19 Solution. 
Compose the characteristic equation: 


247 


248 5 Vector Spaces 


Find the characteristic polynomial, expanding the determinant in the first column: 


pia) = (4—A)(-3 — A) — A) + 45) — 8157 — A) — 45), 
p(a) = AP — 8A? — 800 + 384. 
Find the eigenvalues: A; = 12, 42 = —8 and A3 = 4. 
Compose the eigenvectors. 
—8x, + 15x2 — 3x3 = 0, 
(1) For \1, 8x, — 15x2 + 3x3 = 0, >a, = c(3, 1, —3). 


—15x2 —5x3 =0 
(2) For Aa, 


12x; + 15x2 — 3x3 = 0, 
8x1 + 5x2 + 3x3 = 0, >a,=c(-1,1, 1). 
—15x2 + 15x3 = 0 


(3) For A3, 


15x2 — 3x3 = 0, 
8x} — 7x2 +3x3=0, = a3 =c(-1,1,5). 
—15x2 + 3x3 =0 


Write the transformation matrix and the matrix inverse of it: 


3. Sta ; 110 
PS ae ay a ls PCs; ey ee | 
a ee 164i 


As a result, the diagonalized matrix is equal to 


12 0 0 —3-1-1 
B=P-'AP=| 0 —g0]|, whereP=|]-1 1 1 
004 3.1L 35 
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5.20 Solution. 
(1) The characteristic equation has the form: 


iP 1 4 | 
|A-—Al| = 6 3-2 6 = 0. 


| tg taal 


Compute the characteristic polynomial: 


pa) = 4—a)(B -A)(-11 — A) + 30) — (-11 — A) + 30) 


+4(—30 — (3 —A)-(—11)), 
PO aad? =A = 3h, 


The eigenvalues are A; = —3,A2 = —1 and A3 = 0. 
Compose the eigenvectors. 


For i}, 
7x, + x2 + 4x3 = 0, 
6x1 + 6x2 + 6x3 = 0, => a, =c(1, 1, —2). 
—l11lx = 5x2 = 8x3 =0 
For A2, 
5x, + x2 + 4x3 = 0, 
6x, + 4x2 + 6x3 = 0, => a2 = c(5, 3, —7). 
—I11x; — 5x2 — 10x3 = 0 
4x, + x2 + 4x3 = 0, 
For A3, 9 6x1 + 3x2 + 6x3 = 0, = a3 = c(—1,0, 1). 
—11lx; — 5x2 — 11x3 =0 
1 5 -l 
The transformation matrixis P=] 1 3 0 ], 
—2-7 1 
=3 2-3 
the matrix inverse of ithas theform P~' =| 1 1 1 


1 3 2 
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The diagonalized matrix: 


-3 00 1 5 -1 
B=P-'AP=| 0 -10|, whereP=| 1 3 0 
0 00 —2-7 1 


(2) Write the characteristic equation: 


p(a) = (—23 — A)((39 — A)(—10 — A) + 64- 7) + 16(58(—10 — A) 
+11- 64) — 28(—7- 58 + 1139 —A)), 
p(a) = —a? +647 — 11A + 6. 


The eigenvalues are Ay = 3, Az = 2 and A3 = 1. 
Compose the eigenvectors. 


For 41, 
—26x, — 16x2 — 28x3 = 0, 
58x, + 36x2 + 64x3 = 0, => a, =c(2,—5, 1). 
—11x = 7x2 =, 13x3 =0 

For A2, 
—25x, — 16x2 — 28x3 = 0, 
58x, + 37x2 + 64x3 = 0, => ar = c(4, —8, 1). 
—1lxy — 7x2 — 12x3 =0 

For 43, 


—24x1 = 16x2 > 28x3 = 0, 
58x; + 38x2 + 64x3 = 0, => a3 =c(5, —-11, 2). 
—11lx; — 7x2 — 11x3 =0 


2 4 5 
The transformation matrix is P = | —5 —8 —11}], 
| 1 1 2 | 
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—5 -—3 —4 
and the matrix inverse of itis P~! = | —1 —1 —3 
3 2 4 
Write the diagonalized matrix: 
300 2 4 5 
B=P~'AP=|(020]|., where P= | —5 —8 -11 
001 1 1 2 


5.21 Answer: 

The characteristic polynomial has the form p(A) = 43 — 3aA?+3a7A—A+ (a— 
a>); its roots are equal to A; =a — 1,A2 =a anda3 =a+ 1. 

The diagonalized matrix is equal to 


a+10 0 -1 1 0 
B=P-'AP=| 0 qa 0 |, whereP=] 1 -1-1 
0 Oa-Il 0 1 1 


5.22 Solution. 
Write the left side of the characteristic equation det(Q2 — AJ) = 0, expanding the 
determinant in the first column: 


—-r7 10... 0 0 
0 -Al1... 0 
GStS2 Al) =| 2 cnienpoeinevacceeanrs 
—~A 61 
) .-O -A 
—A 10 10 0 O 
0 -Al... —A 1 0 O 
Sia oe area nea: ce ae) a ee ree 
0 00 1 0 1 0 
o 0 0...-A’ 0 -ir 1 


Meany = 1a GF = ye: 


As a result, the characteristic equation for the matrix Q has the form (— 1)" (A” — 
@) = Oor (A” — w) = 0. 
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5.23 Solution. 

According to the Cayley—Hamilton theorem, when the matrix is substituted into 
its characteristic equation, an identity is obtained. As is shown in Problem 5.22, the 
characteristic equation for Q has the form (A” — w) = 0. Then, the equality is valid: 


Q” —ol = O, or 2" =ail, 


where, as usual, O is the zero matrix of size n x n and J is the identity matrix of the 
same size. 


5.24 Solution. 

1 g/n 
—g/n 1 

The eigenvalues of this matrix are equal to 41,2 = 1 tig/n; the eigenvectors 
corresponding to them are equal to [1, i]? and[1, —i]”. 

Compute the power of A”, having brought the matrix to diagonal form A’ first: 


Denote A = 


—1 
it 1 g/n||1 1] |1+ig/n 0 


i —l —g/n 1 i —l 0 1—ig/n 


A= 


and therefore, according to the theorem on the power of a special form matrix on 
page 56, 


An = 11 (A’y" 11 
i -1 i —I 
-1 —1 
_ 1 1 (1 +ig/n)” 0 11 
i -l 0 (1 —ig/n)" i —1 


After simple computations, we obtain 


1) (U+ig/ny" + —ig/ny" i(-G +ig/n)" + 1 — ig/n)") 
2 | i(+ig/ny" — (1 —ig/n)") (1+ig/n)" + (1 —ig/n)" 


A" = 


Using the relation known from mathematical analysis [76] 


lim (1+t/n)" =e’, 
n—->oo 
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which is valid for all t € C, perform the limit operation: 


‘ eip/n = eo ig/n —i(ele/" = ee/n) 
lim A* = = 
noo 2 —i(eiv/n = eiv/n) eig/n i e ig/n 


Representing the exponent of the imaginary number by Euler’s formula cos z + 
i sinz = e’*, we arrive at the final answer: 


n 


, 1 g/n cos@ sing 
lim = 
my _o/n 1 — sing cos@ 
5.25 Answer: 
—2i00 —2 -2i -i 
Z’=P"'ZP=| 0 i0|, whereP=| 0 1-i 0 
0 Ol 1 2 1 


5.26 Solution. 

As is known, the traces of identical matrices coincide (see Problem 5.17). 
Therefore, the sum of the eigenvalues of the matrix is equal to the trace of this 
matrix: 


3 
trY =A, +A2+A3 = )~ yi = 910+ (68 — 1070i) + (68 + 10707) = 1046, 


i=1 
and the third eigenvalue is equal to 
A3 = 1046 — (Ay + Az) = 1046 — (11 +577) + (11 — 57i)) = 1024. 


5.27 Proof. 
Let A be an arbitrary Hermitian matrix of size n x n that has the eigenvector b, 
which is satisfied by the eigenvalue Ag. This means that the equality is fulfilled 


Ab = hob. (5.64) 


Consider the expression b” Ab. It is equal to a real number, since according to 
the theorem on Hermitian conjugation of product on page 181, 


(b” Ab)? = bY A# (b")7 = b" Ad. (5.65) 
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According to (5.64), the equality is fulfilled 
b” Ab = dob" b = do(|bi|” + |bal? + +> + [bnl?), (5.66) 
where by, b2,..., by are the components of the vector b. 


Comparing (5.65) and (5.66), we obtain that Ag € R. 


5.28 Proof. 
Let U be an arbitrary unitary matrix that has the eigenvector b, which is satisfied 
by the eigenvalue Ag. This means that the equality is fulfilled 


Ub = hob. (5.67) 


Consider the expression (U b)4 (Ub). By virtue of (5.67), this expression can be 
presented in the form: 


(Ub) (Ub) = (A0b)” (Anb) = Abb") (Ab) = XA0D" b= |Ao|?b"b. (5.68) 


Note that the transformations used property (3) of the Hermitian conjugate operation 
(see page 181). 

On the other hand, relying on the theorem on Hermitian conjugation of product 
on page 181 and on the property of unitarity of U4 U = I, we obtain 


(Ub)" (Ub) = (b@ U")(Ub) = b4 (UU) = bb = bd. (5.69) 


Comparing (5.68) and (5.69), we come to the conclusion: |Ag|? =1. 

Therefore, the complex number Ag is located on a complex plane, at a distance 
of p = 1 from the origin of coordinates. The locus of all such points Ag is the unit 
circle with the centre at the origin of coordinates, which is what we set out to prove. 


5.29 Answer: 


1 1 
for o1, A1,2 = +1, the eigenvectors are bj 7 = —= : 
V2] +1 
1 1 


for 02, A1,2 = £1, the eigenvectors are bj 2 = 


J2 | +i 


1 
for 03, A1,2 = £1, the eigenvectors are bj = and by = 
0 


Chapter 6 M®) 
Vectors in a Three-Dimensional Space oes 


Geometrical vector is a directed segment AB with the beginning at the point A and 
the end at the point B. Hereinafter, the word “geometrical” in this definition will be 
omitted for brevity. 

Zero vector, or null vector, is a vector whose beginning and end coincide. 

The length of the segment AB is called the modulus or magnitude of the vector 
AB and is denoted by |AB|. 

The vectors lying on parallel lines are referred to as collinear. 

Unit vector is a vector whose modulus is equal to one. 

The vector b is the product of the number a and the vector a, if the following 
conditions are met: 


(1) |b| = abs(q@)|a|; 
(2) directions of the vectors a and b coincide if ~@ > 0, and these vectors are 
oppositely directed if a < 0. 


The product of zero and a is equal to zero vector by definition. 

The two vectors AB and CD are considered to be equal, and if they are collinear, 
they have the same moduli and are unidirectional. 

A vector 


(6.1) 


is called the normalized vector of the vector AB. It is a unit vector whose direction 
coincides with that of the vector AB. 

When determining the sum of vectors, the law of parallelogram should be used 
(see Fig. 6.1). 
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Fig. 6.1 Determine the 
vector c as the sums of the 
vectors a and b (law of 
parallelogram) 


Let the line L and the point A be specified in the space. Let us draw, through the 
point A, the plane zr, which is orthogonal to the line L. The point of intersection of 
the plane z and the line L is called the projection of the point A on the line L. 


Consider the vector AB and the line L in space. Let A’ and B’ be projections 
of the points A and B on the line L, respectively. Then, the vector AB’ is called a 
projection of the vector AB on the line L and is denoted by Pry AB. 

The numeric projection of the vector AB on the line L is equal to the modulus 
|ABI, multiplied by the cosine of the angle a between the vector AB and the line L, 
i.e. Pry AB = |AB| cosa. 

Numeric projections of vectors have the following properties: 


Pry (a+b) = Pri a+ Pr, b, (6.2) 


Pry (aa) = aPry a. (6.3) 


6.1 Cartesian Coordinate System 


Cartesian! coordinate system is a system that consists of the reference point O, the 


mutually perpendicular axes Ox, Oy, Oz, that intersect at the point O, and a scale 
unit segment. 

Let A be an arbitrary point in space. The vector OA is called a position vector 
of the point A. The numeric projection of the vector OA on the axis Ox is denoted 
by x and is called an abscissa, on the axis Oy by y and is called an ordinate, and 
on the axis Oz by z and is called an applicate of the point A [19, 23]. 

The respective coordinates of the equal vectors coincide. 


'René Descartes (1596-1650), French philosopher, mathematician, physicist and physiologist. 
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Fig. 6.2 Cartesian 
coordinate system 


The real numbers x, y, z are coordinates of the point A and the position vector 
OA, which can be written as A(x, y, z) and OA = (x,y, Z). 

Normalized vectors (basis vectors) of Cartesian coordinate system are the unit 
vectors i, j, k (see Fig. 6.2). In coordinate representation, they have the following 
form: i = (1,0, 0), 7 = (0, 1, 0) and k = (0,0, 1). 

An arbitrary vector AB = (x, y, z) can be represented in the form of a basis 
expansion of Cartesian coordinate system: 


> 
AB=xi+tyj+7k. (6.4) 


The linear operations on vectors, i.e. addition of vectors and multiplication of a 
vector by a number, are said to be performed component-wise or coordinate-wise. 
This means that if a = (ay, dy, az), b = (by, by, bz) and c € R, then 


a+b= (a, +by,ay+by,az+bz), ca = (cay, cay, caz). (6.5) 


With the help of the normalized vectors of the coordinate system, the obtained 
equalities can be written as 


a+b = (a,+by)it (aytby)j+(a,tbhz)k, ca=cayit+cayj+cazk. (6.6) 


Let us illustrate the use of the introduced definitions by the following example. 
Example 6.1 For the vectors v; = (—2,—1,7), v2 = (0,4, —6) and the scalar 
t = 3, we have 

vi t+ v2 = (-24+ 0, -14+ 4,74 (—6)) = (-2,3, 1) = -—284+- 37 +k, 
tv; = (3: (—2),3- (-1),3-7) = (—6, —3, 21) = —6i — 37 + 21k. 
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6.2 Scalar Product of Vectors 


Recall (see page 223) that the scalar product of the two vectors a = (Xq, Ya, Za) 
and b = (xp, yp, Zp) is denoted by a - b and determined through their coordinates as 
follows: 


a-b=XqxXp + Yayb + ZaZb- (6.7) 


It is easy to verify that for scalar products of basis vectors the series of equalities 
is valid: 


i-j=i-k=j-k=0 and i-i=j-j=k-k=1. (6.8) 
Theorem 6.1 The scalar product of the two vectors a and b is equal to the product 


of the moduli of these vectors by the cosine of the angle a between them. 


The concepts of projection of a vector on a line and scalar product of vectors are 
closely connected. Indeed, since the projection of the vector a on the line, containing 
the vector b, is equal to Pry a = |a| cosa and, on the other hand, Prg b = |b| cosa, 
then we can write 


a-b= |a\|b| cosa = |a| Prq b = || Prp a. (6.9) 


Ifa-b = 0, buta 4 0 and b ¥ 0, then such vectors are referred to as orthogonal, 
since the angle between them is equal to 2/2. Recall (see page 224) that the notation 
of the form a L b is used to denote orthogonality of vectors. 


Example 6.2 Compute the scalar product of the vectors a = (3,2,1) and b = 
(0, 2, 1). 


Solution 


#653029 2412125. (6.10) 


The length |a| of the vector a = (xg, Ya, Za) 1s computed by the formula: 


lal = /x2 + y2 + 22. (6.11) 


Example 6.3 Find the length of the vector a = (5, —3, —1). 
Solution The length |a| is equal to ,/52 + (—3)? + (-1)? = /254+94 1 = J35. 
| 
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The distance between the points A(%q, ya, Za) and B(xp, yp, Zp) is computed by 


the formula: 
AB = yf (Xa — x5)” + (Ya — yo)? + Ca — Z0)?. (6.12) 


Example 6.4 Find the distance between the points A(5, 3, 2) and B(0, 3, 2). 


Solution 
AB = V(5 —0)?4+ 3 — 3)? 4+ (2-2)? =5. (6.13) 


oO 


Find the angle between two arbitrary vectors of a three-dimensional vector space. 
Let the vectors a = (Xqg, Ya, Za) and b = (xp, yp, Zp) be given. Represent the 
scalar product a - b in two ways, namely by the formulae (6.7) and (6.9): 


a-b=XgXp + Yayo + 2aZb; (6.14) 
a-b=|a\|b| cosa. (6.15) 
Hence, we can conclude that the cosine of the angle a is equal to 


cosa = __ %akh tT YaYb Flat (6.16) 


Vx2 + y2 + 22,/ x2 + ye + x3 
It is clear that 


XaXb + YaYb + ZaZb ) (6.17) 


Vx2 + yz + chy xp + yp + 


Example 6.5 Find the angle between the vectors a = (1, 2,3) and b = (0, 2, 1). 


a= arccos ( 


Solution Using the formula (6.17), we obtain 


a = arccos (ae) = Gite (6 18) 
T+44+9-/0+441 V70 
| 


Example 6.6 Let two vectors be given: a = (5,4, 1) and b = (2, —2, —2). Are 
these vectors collinear or mutually orthogonal? 
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Solution If the vectors are collinear, then there exists such a real number A that the 
condition a = Ab is satisfied. Hence, it follows: 

Xa Ya La 


appa oo (6.19) 


; ile PD 
But since there exist two inequalities 5 x = # = then the vectors a and b are 


non-collinear. 
In order to check the mutual orthogonality of the vectors, find the scalar product 
of a- b: 


@bs52=49 51-956: (6.20) 


Then, the vectors a and b are mutually orthogonal. oO 


6.3 Vector Product of Vectors 


A vector product, or cross product, of two vectors specified in Cartesian coordi- 
nate system as a = (Xq, Ya, Za) and b = (Xp, yp, Zp) is a vector denoted by a x b, 
or [a, b], and determined according to the rule: 

ijk 
axXb= \xq ya Za| = Wa: 2b — Za* Yo)i + Za Xb — Xa Zh) J + Xa Yo —Xb+ Vad. 


Xb Yb 2b 
(6.21) 


6.3.1 Properties of the Vector Product 


For the arbitrary vectors of the three-dimensional vector space R?, the following 
properties are valid. 


Property 1 
la x b| = |a||b| sina, (6.22) 


where @ is the angle between a and b. 


Proof Prove that the equality is valid: 


la x bl? = |a|?|b\? sin? a. (6.23) 
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Express |a x b|? in terms of the coordinates of the initial vectors: 


|a x Bl? = (ath — ZaYb)? + (ZaXb — XaZb)? + (KaYb — XbYa)? = 223 


+2495 + 2axb + Kar + X45 + x6Ya — 2YazZbZaYb 
—2ZaXbXaZb — 2XaYbXbYa- 
On the other hand, the right side of the equality (6.23) can be represented in the 
form: 
lal? ||? sin” a = |a|?|B7(1 — cos” a) = (xg + ya + za) 5 + Y5 + 25) 
— (ax + YaYo + Zazb)” = XGxh + XGV5 + Kar + Yarb + Yas 


2-2 22 
+Yalp + 2gXp 


+2aVb + Za%b — XaXb — YaYb — 2a2b — 2XaXbYaYb 
—2xqXbZaZb — 2VaYbZaZb 
= Yarh t+ Zab + Lars + Xarh + XaVb + XbYa — 2azbZaYo 
—22aXpXaZb — 2XaVoXbYa- (6.24) 
From the relation |a x b|* = |a|*|b|? sin? a, we finally arrive at the conclusion 
that |a x b| = |a||b| sina. 


Hence, it follows: if a and b are collinear, then a x b = O (since sin0 = O and 
sinz = 0). 


Property 2 If the vectors a and b are non-collinear, then the vector c = a x Dis 
orthogonal to each of the vectors a and b. 


Proof Expand the scalar product of the vectors with regard to their coordinates: 
(ax b)-@ = YaZpXa—ZaYbXa —XaZbYat ZaXbYatXayVoZa— YaXbZa =. (6.25) 


Similar to formula (6.25), we obtain that (a x b)-b =0. 
Therefore, the vector ¢ is orthogonal to both the vector a and the vector b. 


Property 3 
axb=-bxa. (6.26) 
Property 4 


ax(b+c)=axb+axe, (6.27) 
(a+b)xc=axc+bxe. (6.28) 
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Fig. 6.3 To the computation 
of the area of the triangle 
ABC (Example 6.8) 


Example 6.7 The vectors a = (3, —1, —2) and b = (1, 2, —1) are given. Find the 
coordinates of the vector (2a — b) x (2a + b). 


Solution Determine the coordinates of the new vectors: 2a — b = (5, —4, —3) and 
2a + b = (7,0, —5). Then, the sought-for vector will be equal to 


ij k 
5 —4 —3| = 201i + 47 + 28k. (6.29) 
70 -5 

We obtain the answer: (20, 4, 28). oO 


Example 6.8 The points A(1, 2, 0), B(3, 0, —3) and C(5, 2, 6) are given. Compute 
the area Sa apc Of the triangle ABC (see Fig. 6.3). 


The square of the triangle ABC is a half of the square of parallelogram, formed 
by the vectors AB = (2, —2, —3) and AC = (4, 0, 6). Therefore, 


1 as 
SAABC = 7 SABDC = rile x ACI. (6.30) 
Then, 
ij k 
> > 
AB x AC = |2 —2 —3] = —12i — 24j + 8k. (6.31) 
40 6 
Hence, 


1 
SAABC = mh 12? + 242 + 82 = 14. (6.32) 
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6.4 Scalar Triple Product 


The scalar triple product, or mixed product, of the vectors a, b and c is the 
number (a, b, c), resulting from computation of the expression (a x b) - c. First, the 
vectors a and b are multiplied vectorially, and then, the resulting vector is multiplied 
by ¢ scalarly. 

It is easy to verify the formula that expresses the scalar triple product in terms of 
their coordinates: 


(a, b,c) = ((a x b) - €) = (Vath — ZaYd)Xe + ZaXb — XaZb) Vo + (XaYb — VaXb)Ze 


Xa Ya Za 
= |Xb Yb Z| - (6.33) 


Xe We Ze 


For designation of the triple scalar product (a x b) - c, the notation (abc) is also 
used. 


6.4.1 Properties of Scalar Triple Product 


Property 1 
(ax b)-c=a-(bxc)=(cxa)-b. (6.34) 

Property 2 
abs((a x b)-c) = Vy, (6.35) 


where V,, is the volume of the parallelepiped formed by the vectors a, b and c (see 
Fig. 6.4). 


Proof By the definition of numeric projection of the vector ¢ on the line, specified 
by the vector a x b, the equality is valid: 


(a x b)-c=|a x bl Praxp ec, (6.36) 


where |a x b| = S the area of the parallelogram lying in the base of the 
parallelepiped. 

The vector a x b is orthogonal to the base of the parallelepiped, and, therefore, 
Prag € coincides with the height of the parallelepiped h. 
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Fig. 6.4 To the computation 
of the volume of the 
parallelepiped 


Thus, 
abs((a x b)-c) = S-h, (6.37) 


as we set out to prove. 


Note Since the tetrahedron ABCA’ forms one-sixth part of the volume of the 
parallelepiped, its volume is equal to 


1 1 
VABCA' = rad = gabst(a x b)-c). (6.38) 


Three vectors are called coplanar if all of them are parallel to the same plane. 
Property 3 The vectors a, b and ¢ are coplanar if and only if (a, b, c) = 0. 


Example 6.9 Let the vertices of the tetrahedron be given A(2, —1, 1), B(S,5, 4), 
C(3, 2, —1) and D(4, 1, 3). Compute its volume. 


. — — — 
Solution Let us make the vectors AB = (3,6,3), AC = (1,3, —2) and AD = 
(2, 2, 2). Then, the volume of the tetrahedron A BC D is equal to 


36 3 
Vanco =Zabs(AB x AC): AD) = rabsdet 13-2 (6.39) 
235 
ins 
=; -3-2-absdet |] 1 3 —2 | = abs(—3) =3. (6.40) 
111 
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Example 6.10 Find whether the following vectors are coplanar a = (2, —1, 2), 
b= (1, 2, —3) and c = (3, —4, 7). 


Solution Find the scalar triple product of the given vectors: 


P13 
@b.oal) t 3) S2-(l4=1941-949)42-4=6)50, (641) 
324.7 


Therefore, the vectors a, b and c are coplanar. 


6.5 Vector Triple Product 


Having three vectors, for example, a, b and c, apply the vector product operation 
first to b and c, and then vectorially multiply a and b x c. As a result, we obtain the 
vector triple product a x (b x c). 


Theorem 6.2 For arbitrary vectors of a three-dimensional vector space a, b and c, 
the identity is valid: 


ax (bx c)=D(a-c)—c(a-b). (6.42) 


For proof see Problem 6.12. 
Note The relation (6.42) is also referred to as Lagrange’s identity. 


Consequence. For the operation of vector triple product, the Jacobi identity is 
valid: 


ax(bxc)+ex(axb)+bx(cexa=0. (6.43) 


Proof For each of the three summands of the sum, use the expansion (6.42): 


ax (bx c)=Db(a-c) —c(a-b), (6.44) 
bx (cx a)=c(b-a)—a(b-c), (6.45) 
cx (ax b)=a(c-b)—Dd(c-a). (6.46) 


Computation of the sum of the three vector products (6.44)-(6.46) after collect- 
ing similar summands results in zero. The consequence is proved. 
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Review Questions 


CANN WN Ke 


ee 
= © Oo 


lll ell oll etl oe oe 
OANINNMNBWN 


. Define geometric vector. 

. What are zero vector and unit vector? 

. What vectors are called collinear? 

. What is the condition of equality of vectors? 

. Formulate the rule of parallelogram for addition of vectors. 

. How is the projection of the vector b onto the vector a determined? 

. What is the Cartesian coordinate system? 

. Enumerate the normalizing vectors of the Cartesian coordinate system. How can 


one use them to expand an arbitrary vector in the Cartesian basis? 


. Define scalar product of vectors. 
. What vectors are called orthogonal? 
. How do we find the distance between the two points specified by their Cartesian 


coordinates? 


. Define vector product of vectors. 

. Enumerate the properties of a vector product. 

. Define scalar triple product of vectors. 

. Enumerate the properties of a scalar triple product. 

. What three vectors are called coplanar? 

. What is vector triple product? 

. Write Lagrange’s identity for the vector triple product. 
. Write the Jacobi identity. 


Problems 


6.1. Compute the scalar and vector products of the vectors c = 2a — b and 
co = —a + 3b, if: 


(a) a= (-2,1,1),b= G, -2,4); 
(b) a= (2,1, —2), b = (—1,0, —2). 


6.2. There are given vertices of the triangle ABC. Compute its area and the 
cosine of the inner angle at the vertex B: 


(a) A(2, 1,0), B(3, 0, 3), C(2, —3, 7); 
(b) A(4, —3, 2), B(-1, 4, 3), C(, 3, —2). 


6.3. Find whether vectors a, b, c are coplanar: 


(a) a= (1,1,1),b = 2,3,0),¢ = 3,-1,-D; 
(b) a = (—1,0, —2), b = (0,0, —1), e = (—1, 0, 3). 


6.4. Prove that the points A(1, —1, 1), BC, 3, 1), C(4, 3, 1) and D(4, —1, 1) are 
vertices of a rectangle. Compute the length of its diagonals. 
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6.5. 


6.6. 


6.7. 


6.8. 


6.9. 


6.10. 


6.11. 


6.12. 
6.13. 


6.14. 


*6.15 


6.16 


Compute the coordinates of the vector ¢ that is orthogonal to the vectors 
a =2j —kandb = —i +27 — 3k and forms an obtuse angle with the axis 
Oy, if |e| = V7. 

Find the angle between the vectors a + b and a — b, if a = 3i — j + 2k and 
b=i+j—-k. 

The vectors a and b form the angle 7/3. Find the length of the vector a — 2b, 
if |a| = 2, |b] = 1. 

For what value of the real parameter d are the vectors a = (12,2, d) and 
b = (—3, 17d, —1) orthogonal? 

For what value of the real parameter x will the vectors of the three- 
dimensional Euclidean vector space t; = a — 10b and tz = a+ xb be 
orthogonal, if |a| = 5, |b| = 3, and the angle g between the vectors a and b 
is equal to —? 

For what value of the real parameter x will the vectors of the three- 
dimensional Euclidean vector space t; = 2a + xb and t2 = b — 2a be 
orthogonal, if |a| = 1, |b| = 3/2, and the angle g between the vectors a and 


. ca 

b is equal to —? 

The vectors a and b have, in Cartesian basis, the coordinates a = (a1, az, 0) 
and b = (bj, bo, 0). Find the sine of the angle between these vectors. 


Prove the theorem on vector triple product (6.42). 
Prove the identities valid for the arbitrary vectors a, b, c and d: 


(1) (ax b)- (cx d)=(a-c)(b-d) —(a-d)(b-c); 

(2) (a x b) x (c x d) = c(abd) — d(abc); 

(3) a x (bx (c x d)) = (b- d)(a x c) — (b- c)(a x d); 

(4) (((a x b) x (b x €))((B xX €) x (€ x a))((e x a) X (a x b))) = (abc)*. 


Prove the identity valid for the arbitrary vectors a, b, c,d, e, f: 


(a x b)- ((c x d) x (ex f)) = (abd) (ce f) — (abc) (def). 
Simplify the vector expression that depends on the natural parameter n: 


fn =(ax-+--x (ax (axb))...). 
SS ey eee eee 


n products 
Prove that for p1, p2, p3, p4, ps € R? the equality is fulfilled: 


(Pi P2P4) (P1p2ps) 


(P1p2p3)(p1p4ps) = ; 
(P1p3p4) (Pi p3ps) 
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*6.17. 


*6.18. 


*6.19. 
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Solve the system of linear equations, represented in vector form, relative to 
the unknown variables x1, x2, x3: 


a-x=y, 


axx+fP=0, 


where x = (x1, x2, x3), and the vectors w 4 0, B and the scalar y do not 
depend on x1, x2, x3. 

Solve the system of linear equations, represented in vector form, relative to 
the unknown variables x1, x2, x3: 


a:-x =—C\, 
Bx =co, 
yx =0C3, 


where x = (x1, x2, x3), and the vectors w, B, y and the constants c1, c2, c3 
do not depend on x1, x2, x3 and (a, B, y) #0. 
Solve the system of equations relative to the unknown vectors x and y: 


HXX+pxy=oa, 


pxx-HXY=T, 


where x, ,0,tT € R”, and the vectors m and p are not equal to the zero 
vector simultaneously. 


Answers and Solutions 


6.1 Solution. 


(a) Write the vectors c; and c2 in coordinate form: 


j= 2(-2, 1, 1) =o 3, —2, 4) = (—7, 4, —2), Q= —(-2, 1, 1) 
+3(3, —2, 4) = (11, -7, 11). 


The scalar product is equal to cy} -cz = —7-114+4-(—7) —2- 11 = —127. 
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The vector product is equal to 
ij k 
c1Xe2=|-7 4 —2) = 30i +557 + 5k = (30,55, 5). 
11 —7 11 
(b) Write the vectors c; and c2: cy = (5, 2, —2), co = (—5, —1, —A4). 


Their scalar product is equal to cy -¢2 = 5-(—5) +2-(—1) —2-(—4) = -19. 
The vector product: 


ij k 
c1xe2=|5 2 —2| = —10i +30j + 5k = (—10, 30, 5). 
=§-1=4 


6.2 Solution. 


(a) Compute the coordinates of the vectors BA and BC: 


_ 
BA = (2—3,1—0,0—-—3) = (-1, 1, —3), 


_ 
BC = (2-3, -3—0,7— 3) = (-1, —3, 4). 


Then, find the cosine of the angle ¢ at the vertex B: 


=_S>lc > 
mse i Seal cc ae a a) de 
|BA|-|BC| V1? +122 + C3)? VED? +O 3 + 
_ 14 
4/286 


The area can be computed in two ways. 
The first method 
=> =>. . 
The product of the vectors BA and BC is determined as 


ij k 

> > d 

BAxBC=|-1 1 —3} = (4-9)i-(—4-3) J+ G+Dk = —Si+7j+4k. 
-1-3 4 


Substitute the obtained values of the coordinates into the formula for the area 
of the triangle: 


1 3 
IBA x BC| = 5V5P +P +e = 510. 
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The second method 
According to the fundamental trigonometric identity (see Appendix B, 
formula (B.1)), we have 


oe 14" 3/10 
sing =,/1—cos*-g=,/1 786 = ee 


Substitute the coordinate values into the area formula: 


I>  _—“n" 
S= 3 |BAl -|BC|- sing 


1 3V10 3 
= ~/(_-p24 72 —3)2.,/(—1)2 —3)2 4 42. = -vV10. 
VC ye + 1* + (-3) (—1)* + (—3)- + ae 


Obviously, both methods of computing the area of the triangle result in the 
same answer: 


14 3 
cosy = —-—=—, S=-—v10. 
286 2 


(b) The coordinates of the vectors BA and BC are equal to 
BA=6,-7,-l, BC=(,-1,-5). 
Compute the cosine of the angle g between these vectors: 


(BA-BC) _ 5-74 (-1)-(-D+(-5)- CD 47 


cos g= ——___ = =—. 
|BA|.|BC| V2 +(-7? + CD?- VP +CD?+ C52 75 


Then, compute the vector product of BA and BC: 


ij k 

=_—>—lc cr 

BA x BC = |5 —7 -1| = 34i + 187 + 44k. 
6 —1 —5 


Substitute the coordinate values into the area formula: 


1 1 
f= 5|BA x BC| = 5 V342 + 18? + 44? = /854, 
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We finally obtain 


6.3 Solution. 


(a) Since the determinant composed of the vector coordinates is not equal to zero: 


11 1 
3-1-1 


then the vectors a, b and ¢ are not coplanar. 
(b) The determinant composed of the vector coordinates is equal to zero: 


—10-2 
00 1/=9, 
—-10 3 


and, therefore, these vectors are coplanar. 


6.4 Solution. 

Note that the points A, B, C and D lie in the same plane, since the applicate of 
all these points is equal to z = 1. 

Let us prove that these four points are vertices of a parallelogram. We have to 
prove that 


> > > > 
|AB|=|CD|,  |BC|=|ADI. 
Find the coordinates of the vectors introduced for consideration: 
—— > 
AB=(1-—1,3-(-1),1—1) = (0,4, 0), 
=> 
CD= (4-4, -1-—3,1-1) = (0, —4, 0), 


—> 
BC = 4=13=31= 1) = G,0;0), 
— 

AD= 4a) ASEH] G.0,0), 
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Therefore, 


|AB| = V+ 2 +02 =4, 

= — 
ICD =VPFP FP =4, (AB = 
IBC| = V2 +0+0=3,  |BC|=|ADI. 
— 

|AD| = V3? + 07 + 07 = 3; 


If one of the parallelogram angles is equal to >. then all other angles are also equal 


1A 
to —. 
2 


— —, 

Show that the scalar product of the vectors AB and AD is equal to zero: 
—> > > —-> 
AB-AD=0-3+4-0+0-0=0=5 AB 1 AD. 


Therefore, ABC D is a rectangle. Find its diagonals with the help of the Pythagorean 
theorem: 


AC = VAB? + BC? = V4 +32=5. 


It is clear that AC = BD, since the diagonals of the rectangle are equal. 
As aresult, we obtain AC = BD =5. 


6.5 Solution. 

Let c = (Cy, Cy, Cz), where cy, Cy, Cz are the unknown coordinates of the vector. 
The condition of orthogonality of a and c has the form a - c = 0, or 2c, — c, = 0. 
Then, the condition of orthogonality of b and c is b-c = 0, or —cy + 2cy — 3c, = 0. 
Since |c| = V7 and the length of the vector is equal to the square root of the 
sum of squares of its coordinates, ce + es + ce = V7, which can be written as 
c+ eG + ce =7. 

We obtain the system of three equations relative to the variables c,, cy, cz: 


2cy — c; = 0, 
—Cy + 2cy — 3c, = 0, 
ee + C + co = 7. 
As the independent variable select c, and express through it two other variables 


of the system: c, = 2cy and cy = 2cy — 3c, = 2cy — 3(2cy) = —4cy. Therefore, 
c= (-4cy, cy, 2cy) = cy(—4, 1, 2), where cy € R. 
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The length of the vector c is equal to |e| = J/7, and, therefore, 


v1 1 
Vii a a ea 


According to the problem statement, the vector c forms an obtuse angle with the 
axis Oy, i.e. cosg < 0, where ¢ is the angle between the vectors c and j. Since 


abs(cy)-v (—4)? + 12 + 2? = V7, abs(cy) = 


or | Cy 


cosy = —— = — <0, 
lei V7 
1 
Cy = Ve 
1 
We finally obtain c = —=(4, —1, —2). 


af 


6.6 Solution. 
Write the vectors a and b in coordinate form: 


a= 3, —1,2), 
b= (1,1,-1). 


Then, the sum and the difference of these vectors are 


a+b=(341,-14+1,24+(-)l) =64,0, 0, 
a—b=(3-1,-1-1,2-(-1) = (@, -2, 3). 


Use the formula of the cosine of the angle ~ between the vectors: 


_ (a-b) 
|a| - |b| 
After simple computations, we obtain 
— (at+b)-(a—b) | 4-24+0-(—2)4+1-3 _ il 


— Tat Olle 8] JSS Es Cae IT 


11 
Therefore, @ = arccos 7 
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6.7 Solution. 
Compute |a — 2b]: 


|a —2b|? = (a—2b)-(a—2b) = a-a—4a-b+4b-b = |a|? —4la||b| cosy +-4|b/7, 


where y = 77/3 is the angle between the vectors a and b. Having substituted numeric 
values, we obtain |a — 2b? = |, and, therefore, |a — 2b| = 1. 


6.8 Solution. 

As is known, the necessary and sufficient condition of orthogonality of two 
vectors is that their scalar product is equal to zero: a-b = 0. Substitute the coordinate 
values from the problem statement: 


12(=3) 4-2. 19d = 1-2 = 0, 


peak? 
=F. 


12 
Therefore, the vectors a and b are orthogonal for the parameter value d = i 


6.9 Solution. 
In order for the vectors to be orthogonal, their scalar product must be equal to 
zero: t) -t2 =0, 


(a — 10b) - (a + xb) = 0, 
a-a+xa-b—10a-b—10xb-b=0. 


10/3 — 10 
3/3 — 36 


Then, we find x = 


6.10 Solution. 
The necessary condition of the vector orthogonality: t; - t2 = 0, or 


(2a + xb) - (b — 2a) = 0, 
2ab — 4la|? + x|b|? — 2xab = 0. 


3 9 3 
Substitute the numeric values from the problem statement: — = 4+ + + > = 


2 
0, and, therefore, x = —. 
15 


aybz — anb, 


a7 + bi, as + bs 


6.11 Answer: sing = 
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6.15 Answer: 


f = ( perry a(a x b), ifn is odd, 
n 
a*b—(a-b)a, if nis even. 


6.17 Answer: 


Y 
x= ae + an x B. 
6.18 Answer: 


x= mE ci * y) + coy x @) + c3(a x B)). 


6.19 Solution. 

Multiply the first equation of the system by the vector p scalarly on the right; 
multiply the second equation by the vector w scalarly on the right. Subtracting one 
equation from the other, we find 


HXO+pPpxXt 4 4 
x = —-—_—_—_ u ; 
H+ p2 YI v2p 
where y1, v2 are real numbers. 
Having substituted the obtained expression into the first equation of the system, 
we find y: 


uXT-pPXo 


+ yo — : 
ep y2 yip 


Chapter 7 
Equation of a Straight Line on a Plane od 


7.1 Slope-Intercept Form of the Equation of a Straight Line 


Consider Cartesian coordinate system on a plane. 
Let the straight line L intersect the axis Oy at the point B with the coordinates 
(0, b) and forms with the axis Ox the angle aw (see Fig.7.1). For definiteness, we 


a 
will assume that the anglea < —. 


On the line, take an arbitrary point A with the coordinates (x, y). Then, from the 
point A, drop a perpendicular to the axis Ox, and from the point B, a perpendicular 
to the axis Oy. Consider the obtained triangle ABC. It is obvious that BC = x, 
AC = y—b, LABC =a. Since AC = BC tana, we obtain y —b = x - tana or 
y=tana-x+b. 

Denote the tangent of the angle a by k. The variable k = tana is referred to as 
the slope of the straight line on the plane. As a result, we arrive at the equation of a 
straight line of the form: 


y=kx+b. (7.1) 


It is called the slope-intercept form of the equation of a straight line. 


Note For the lines of the form y = const, the slope is equal to zero; for the lines of 
the form x = const, the slope is undefined. 


© Springer Nature Switzerland AG 2021 277 
S. Kurgalin, S. Borzunov, Algebra and Geometry with Python, 
https://doi.org/10.1007/978-3-030-61541-3_7 


278 7 Equation of a Straight Line on a Plane 


Fig. 7.1 The line LZ on the 
plane x Oy 


Fig. 7.2. Construction of 
n—a normal vector to the line 


7.2 General Equation of a Straight Line 


General equation of a straight line on a plane has the form: 
Ax + By+C =0. (7.2) 


The real numbers A, B and C are called coefficients of the straight line 
equation. 

The variables A and B cannot simultaneously be equal to zero, because, in this 
case, if C = 0, then all points on the plane will satisfy this equation. However, if 
C # 0, then none of the points on the plain satisfies this equation. 


A 
The vector n = is called a normal vector of a line or a normal. The 
B 
normal vector is orthogonal to the respective line (see Fig. 7.2). 


Let the inequality B + O be valid. In this case, the summand By can be 
rearranged to the right, and both parts of the equation can be divided by —B # 0. 
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As aresult, we obtain 


Ax +C =—By (7.3) 
or 

A C 

es cee 74 

ag oe (7.4) 


. . A Cc . : 
Let us introduce notations —— = k and -— = J; then, we arrive at the equation 


y = kx + b, which is a slope-intercept form of the equation of a straight line 
(see Eq. (7.1)). 


7.3 Slope-Intercept Form of the Equation of a Straight Line 
Through a Given Point 


Consider the slope-intercept form of the equation of a straight line: 
y=kx+b. (7.5) 


Let this straight line pass through a point with the coordinates (x9, yo). Substitute 
these coordinates into the equation: 


yo = kxo +b. (7.6) 


Subtract from the Eq. (7.5) the Eq. (7.6). We obtain the sought slope-intercept form 
of the equation of a straight line passing through the given point: 


(y — yo) = k(x — x9). (7.7) 


Example 7.1 Find the slope-intercept form of the equation of a straight line k = 2 
through T (1, 5). 


Solution Use the formula (7.7). Then, we have 
(y —5) =2@—- 1), (7.8) 
or 


yo, (7.9) 
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7.4 Equation of a Straight Line Through Two Given Points 


Find the equation of a straight line through two given points 7\(x1, yi) and 
T2(x2, y2) subject to the condition that x; 4 x2 and y; ¥ yz. For this purpose, write 
the equation of a straight line in the form (7.7), assuming that is passes through the 
point 7): 


yr yi = k(x — x1). (7.10) 


Since this line also passes through the point 75, then we will substitute its 
coordinates into the Eq. (7.10): 


yo — yi = kK(x2 — 1). (7.11) 
Divide the Eq. (7.10) by Eq. (7.11). We obtain 


ee (7.12) 
y2— Yl X2— XX] 


This is the equation of a straight line through two given points. 


Note The formula (7.12) is not applicable in the case of equality of the abscissas or 
equality of the ordinates of the initial points 7; and 72. If x; = x2, then the equation 
of the line 7; 72 has the form x = x1. If the condition y; = y2 is fulfilled, then the 
equation of this line has the form y = yj. 


Example 7.2 Find the equation of the line through 7; (a, b) and 7T2(b, a), where 
a,béR,andaFZb. 


Solution Use the formula (7.12). Substitute into it the coordinates of the points 
xj =a,y, = b,x2 =b, y2 =a: 


a oe Al (7.13) 
a—b b-a 
After simple transformations, we obtain the equation of the straight line: 
y—-b=—-(x—a), (7.14) 
x+y—-a—b=0. (7.15) 
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7.5 Angle Between Two Straight Lines 


Consider two straight lines specified by the equations y = kjx+b, and y = kpx+b2 
(see Fig. 7.3). 

By the angle a between the lines, we will understand the angle by which one of 
these lines should be turned around their intersection point, anticlockwise, until the 
first superposition on the other line. 

From Fig. 7.3, it is seen that the angle a between the lines is equal to a — a. 
And the equalities tana; = k, and tanaz = kz are fulfilled. In this case, based 
on the formula of tangent of a difference of two arguments (see formula (B.14) in 
Appendix B), we can write 


tana, — tana2 ky —ko 
t =t _ = —____—__ = ——____. 7.16 
an (a — 42) 1+ tana, - tana? 1+ kiko ( ) 
Hence, 
ki — ko 
a = arctan ————. (7.17) 
1+kik2 


Having exchanged places of the parameters k; and kz, we obtain the tangent of 
the adjacent angle @ = x — 9. 
From the obtained formula (7.17) follow two consequences: 
(a) The straight lines with the slopes k; and kz are orthogonal if the condition 1 + 
ki k2 = O is fulfilled, which is equivalent to kz = 7 
1 
(b) The straight lines are parallel if k, = ko. 


Consider the straight lines specified by the equations in general form: 


Aix + Byy+C; =0 and Azx + Boy +C2 = 0. (7.18) 


Fig. 7.3. Definition of the 
angle a between two straight 
lines 
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Al A2 
Then, kj = —— and ky = ——. Therefore, 
By By 


AyBi = AyR 
nes (7.19) 
A,A2 + B, Bo 


From (7.19) directly follows that the lines specified by the Eqs.(7.18) are 
orthogonal at A; A2 + By Bo = 0 and parallel at A2B; — A; Bo = 0. 


Example 7.3 A straight line 2x — Sy + 1 = 0 is given. Set up the equation of a 
straight line that passes through the point 79(3, 3): 


(a) parallel to this line; 
(b) perpendicular to this line. 


Solution 


. A 2 2 
(a) Find the slope kj = —— = —-——- = = 
B (-5) 5 
Write the equation of the straight line that passes through the given point 
with the specified slope kj = ko: 


y—yo = ki(x — x0), (7.20) 
2 
y-3= ge 3) (7.21) 
Thus, the sought straight line has the form: 2x —5y +9 =0. 
1 
(b) Find the slope of the line that is perpendicular to the given one: kz = a ii 
0 


5 


5 
Write the equation of the straight line that passes through the given point 
with the specified slope kz: 


¥y — Yo = k2(x — x0), (7.22) 
5 
y—3= —F@ — 3). (7.23) 
We obtain the equation of the straight line: 5x + 2y — 21 =0. oO 


7.6 Intercept Form of the Equation of a Straight Line 


Consider the equation of the line Ax + By + C = 0, where the variables A, B and 
C are not equal to zero. 
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Fig. 7.4 The line that passes 
through the points (a, 0) and 
(0, b) 


Let us transform it as follows: 


a a 1 (7.24) 
—-—x--—yr= |. . 
Cc a 
C C 
Let us introduce notations: a = —F and b = —-—. 
As a result, we obtain the equation: 
ree Sap (7.25) 
a b 


which is the intercept form of the equation of a straight line. 

It is obvious that the given line passes through the points with the coordinates 
(a, 0) and (0, b). It is shown in Fig. 7.4. 

Thus, this line cuts off segments of length abs(a) and abs(b) on the coordinate 
axes. 


7.7 Normal (Symmetric) Form of the Equation of a Line 


Consider an arbitrary straight line L. Let us draw, through the origin of coordinates 
O, a line, perpendicular to L, and denote by the letter P the intersection point of 
these lines. 

On the line OP, take the unit vector n, whose direction coincides with the 
direction of the vector OP. 

Assume that p = |O P|, and the angle 6 is the angle between the vector nm and 
the axis Ox (see Fig. 7.5). 

Since n is a unit vector, its coordinates are equal to the projections of this vector 
on the coordinate axes: 


n = (cos6, sing). (7.26) 
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Fig. 7.5 To the derivation of 
the normal form of the 
equation of a line 


The arbitrary point T(x, y) lies on the considered line L if and only if the 
projection of the vector OT on the axis determined by the vector m is equal to 
D: 


Pt, OT = p. (7.27) 
As is well known, 
sk SOT 8 
Pr, OT = in] = OT -n. (7.28) 
n 
Bearing in mind that OT = (x,y), and the vector nm is determined by the 


equality (7.26), we obtain the following expression for their scalar product: 
—> ° 
OT -n=xcos@+ysin0. (7.29) 


From the above reasoning, it follows that the point T(x, y) lies on the line L if 
and only if the coordinates of this point satisfy the relation: 


xcos@+ysinéd — p=0. (7.30) 
This equation is called the normal form of the equation of the line L or the 
Hesse! normal form. 
Let the straight line L be specified by the general equation: 


Ax+By+C=0, (7.31) 


where C 4 0. 


: Ludwig Otto Hesse (1811-1874), German mathematician. 
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In order to transform the general equation of a line into a normal form, multiply 
both sides of the equation by the so-called normalizing factor: 


1 
= ——————_sen(C), (7.32) 
: VA? + B? e 


where sgn(C) is the sign of the coefficient C determined by the rule: 


+1, ifC > 0, 
sgn(C)=} 0, ifC=0, (7.33) 
=i, 7 20. 


As aresult, the new coefficients at x and y (namely, A and wB) will satisfy the 
condition: 


(wA)* + (wBY = 1. (7.34) 
If we select the angle 6 so that cos6 = wA and sind = wB, while p = 


—abs(uC), then we will obtain the normal form of the equation of a line. 


Note If the condition C = 0 is fulfilled, then the line passes through the origin of 
coordinates, and the normalizing factor can be taken with an arbitrary sign: ~ = 


~ SAP + BE 

Let us introduce the concept of deviation of an arbitrary point T(x, y) from the 
given line L. Let the number d denote the distance from the point T to this line. 

We will call the number + d the deviation 6 of the point T from the line L in 
the event when the point T and the origin of coordinates O lie on the opposite sides 
of the line LZ and the number — d in the event when the points T and O lie on the 
same side of L. 

Show that the left side of the normal form of the equation of the line is equal to 
the deviation of the point T(x, y) from this line. 
Let Q be the projection of the point T on the axis determined by the vector n. 
The deviation 6 of the point T from the line L is equal to PQ. 
From Fig. 7.6, it is seen that 


6=PQ=OQ-OP=O0Q-p. (7.35) 
— 
But OQ = Pr, OT = xcos@ + ysin@. So, 


OQ=xcos@é+ ysiné. (7.36) 
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Fig. 7.6 The deviation 
5 = PQ of the point T from 
the line L 


Correlating the obtained formulae (7.35) and (7.36), we obtain 
6 =xcosé+ysind — p. (7.37) 


This brings us to the following rule: in order to find the deviation 6 of the point 
T (xo, yo) from the line L, we should substitute to the left side of the normal form 
of the equation of the line the coordinates xp and yo of the point T instead of x and 
y. The distance from the point T to the line L is equal to the absolute deviation. 


Example 7.4 Let us compute the distance from the point T (5, 4) to the line through 
A(1, —2) and B(O, 3). 


Solution Write the equation of a straight line through the points A and B: 


x-1 yt? 


i or 5x+y—-—3=0. (7.38) 


Having multiplied the resulting equality by uw = , we bring the equation to 


1 
26 


the normal form: 
Then, the distance d from the point T to the line is equal to 


mines) 4 3 26 
d = abs | —= + —= — — ) = — = V2. 7.40 
. (te zz) : 
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7.8 Line Segments 


Line segment M, Mp is a part of a straight line between its two points M1 (x1, y1) 
and M2(x2, y2). The points Mj (x1, y;) and M2(x2, y2) are the endpoints of the 
segment [8]. 

A set of points belonging to the segment is specified as follows: 


M\ M2 = {(x, y): x = A —-t)xittry, y= UA—-d)yi+ty2,t € (0, 1]}. (7.41) 
There exists an equivalent notation of M, Mo: 
M, M2 = {(x, vy) Xx = xy +2—-%1)t, Y= yi +O2—-y1)t,t €[0,1]}. (7.42) 


The variable 0 < t < 1 in the formulas (7.41) and (7.42) is referred to as the 
parameter of the segment. 


Example 7.5 Let us write a program in Python that determines, by the segment 
endpoint coordinates, in which coordinate quadrants it is located. For example, the 
segment LL? that connects the points L;(—1, —2) and L2(4, 1), lies in the I, II] and 
IV quadrants. Another example: the segment that connects the points M,(—1, —2) 
and M2(—1, —2) entirely belongs to the II quadrant (see Fig. 7.7, which shows the 
segments L;L2 and M M2 with the numbers of each quadrant). 


Solution In order to represent a Cartesian plane point in the computer memory, 
introduce a class Point, which contains two fields: x and y, the abscissa and 


Ill IV 


Fig. 7.7 To the Example 7.5. The segment L; Zo lies in the I, III and IV quadrants, while the 
segment M, Mp2 entirely belongs to the II quadrant 
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the ordinate of the point. Thus, the segment is determined by the boundary points; 
denote them by P; and P>. 

The main computing work is performed by the function get_ quadrants (p1, 
p2) . It returns the list that contains the numbers of the quadrants where the segment 
P; P> is located. 

The auxiliary function get_quadrant (p) is used to determine the number 
of the quadrant to which the only argument belongs, namely the point p. This 
function returns an integer number from the set {0, 1, 2, 3, 4}. The variable 
get quadrant (p) is equal to zero if and only if p lies on the Ox or Oy axis 
and therefore does not belong to any of the plane quadrants. 

Execution of the function get_quadrant(p1, p2) begins with checking 
whether the points p1 and p2 lie in the adjacent quadrants, i.e. those that form the 
unordered pairs 1-2, 2-3, 3-4, 4-1. During this check, the variables p1_quad and 
p2_ quad will be assigned the numbers of the quarters to which points p1 and p1 
belong, respectively. 

Since the numbers of the adjacent quadrants differ by one modulo two, the 
False value of the boolean variable 

is adjacent = abs(pl_quad - p2 quad) % 2 == 1 
is a sufficient condition of adjacency. 

Then, the following operations are executed. If the points p1 and p2 lie in the 
adjacent quadrants, then into the final list are written the values p1_ quad and 
p2_ quad, following which the function get_quadrants () terminates. 

Otherwise, the equality of the numbers p1_ quad and p2_ quad is checked. If 
it is valid, then the entire segment lies in the quadrant number p1_quad, and the 
function get_ quadrants () terminates. 

The last case of the opposite quadrants remains, i.e. of the pairs 1-3 or 2-4. The 
line drawn through the points p1 and p2 intersects the ordinate axis at the point 
with the coordinates (0, b), where 


b = (pl.y*p2.x - pl.x*p2.y)/(p2.x - pl.x). (7.43) 


Ifp1_quad = | or p1_quad = 3, then at b > 0 it is necessary to additionally 
write the second quadrant to final list, and the fourth quadrant at b < 0. Otherwise 
(in the case p1_quad € {2, 4}) at b > 0, it is necessary to additionally write the 
first quadrant to the list, and the third quadrant at b < 0. 

Thus, all the possible cases of location of the segment P; P2 relative to the 
coordinate axes are exhausted, on the condition that none of the segment endpoints 
lies on the coordinate axis. 

Listing 7.1 provides the text of the program that determines, by the segment 
endpoint coordinates, in which quadrants it is located. 
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Listing 7.1 


class Point: 
def init (self, 
self.x 
self.y 


def get quadrant (p): 

if p.x > 0 and p.y 
return 

elif p.x < and p. 
return 

elif p.x < and p. 
return 

elif p.x > and p. 
return 

else: 
return 


get_quadrants(pl, p2): 
pl_quad = get quadrant (p1) 
p2_ quad = get quadrant (p2) 


is adjacent = abs(pl_ quad - p2_ quad) 


if is adjacent: 
return [pl quad, p2_ quad] 
elif pl_quad == p2_ quad: 
return [pl1_quad] 
else: 
= (pl.y * p2.x - pl.x * p2.y) \ 


_quad, p2 quad] 
= 1 or p2_ quad == 
2 if b > 0 else 4 
_quad, p2 quad, quadrant] 


quadrant 
return [p 


quadrant = 1if b> 0 else 3 
return [pl quad, p2 quad, quadrant] 
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The most general case is when P; or P2 can belong to the coordinate axes and 
is discussed in Problem 7.30. The solution of this problem includes the function 
get quadrants general (), which is free from the mentioned constraint. 

oO 


Review Questions 


1. How is the slope of a straight line on a plane determined? 

2. Write the equation of a straight line with a slope. 

3. What is the form of the general equation of a straight line on a plane? 

4. What is the normal to the line? 

5. What does the equation with a slope for a straight line through a specified point 
look like? 

6. Write the equation of a straight line through two specified points. 

7. How is the angle between two lines computed? 

8. Write intercept form of the equation of a straight line. 

9. Define the deviation of an arbitrary point from a given line. 

0. For solution of what problem is it convenient to use the normal form of the 
equation of a line? 

11. How can the set of points of the segment M; Mp? be specified with the help of a 
parameter? 


Problems 


7.1. Find the intersection point of the lines 2x —3y +4 = Oand4x+y—-6=0. 

7.2. The sides of the triangle lie on the lines 5x — y+ 12 =0,x +y+3=0 
and 4x + 3y — 6 = 0. Find the coordinates of the vertices of this triangle. 

7.3. The coordinates of the vertices of a triangle are (5,—4), (6,—6) and 
(—15, 4). Find the equations of its sides. 

7.4, Show that the area of a triangle with the vertices (x1, y1), (x2, y2) and 
(x3, y3) 1s equal to 


x1 yil 
5S = sabs |xp y2 1). (7.44) 


x3 y3 1 


7.5. The sides of the triangle lie on the lines x + y+1=0,x+2y —3 =O and 
4x — 3y — 2 = 0. Compute the area of this triangle. 
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7.6. 


7.7. 


7.8. 


7.9, 


7.10. 


7.11. 
7.12. 


7.13. 


*7.14. 


7.15. 


*7.16. 


7.17. 


7.18. 


7.19. 


Median of a triangle is the segment that connects its vertex with the 
midpoint of the opposite side. Set up the equations of the lines on which 
the medians of the triangle ABC lie, if A(1, 2), B(4, —3), C(6, 6). 
Compute the distance from the point 7 (1, 7) to the line through A(—3, —20) 
and B(4, 17). 
Compute the distance from the origin of coordinates to the line given by the 
equation ——~* + pene 0, where xo, yo are real numbers not equal to 
0 


0 
Zero. 


One of the sides of the square lies on the line x + 3y + 10 = 0. Find the area 
of this square if the coordinates of one of its vertices are (—4, —4). 

At what point do the lines specified by the equations x/a + y/b = 1 and 
x/b+ y/a = 1, where a, b ¥ 0, intersect? 

Find the angle between the lines 3x + 5y — 10 =O and —2x + y+4=0. 
Find the values of the parameters 1 and yz at which the lines Ax +6y —2 = 0 
and 2x + 3y — 4 =0: 


(1) have exactly one common point, 
(2) coincide, 
(3) are parallel. 


Compute the distance between the parallel lines specified by the equations 

Ax + By +C = Oand Ax + By +C’ =0, where C 4 C’. 

On what condition do the lines Ajx + By +C, = 0, Aox+ Boy + C2 = 0, 
.., Anx + Bny + Cy = 0 intersect at one point? 

The line L passes through the point T(x, yo) at the angle a the abscissa 

axis. Write the equation of the line L* that passes through the same point Tp 

at the angle Aq to the line L. 

The sides of a triangle are specified by the equation Ajx + Bjy + Cj = 0, 

where i = 1, 2, 3. Find the equation of 


(a) median, 
(b) altitude, 
(c) bisector, 


drawn to the third side. 

Compute the area of a triangle intercepted by the line Ax + By +C = 0 
from the quadrantal angle. 

Find the equation of the line that passes through the point T (xo, yo) and 
intercepts from the quadrantal angle a triangle with an area equal to S. The 
variables xo and yo are positive. 

Assume that some line passing through the point T (xo, yo) intercepts from 
the quadrantal angle a right triangle. What is the least area of this triangle? 
The variables xo and yo are positive. 
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*7.20. 


7.21. 


7.22. 


7.23. 


7.24, 


7.25. 


7.26. 


7.27. 


7.28. 


7.29. 


7.30. 
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The sides of a triangle are specified by the equation ajx + Biy + yj = 0, 
where i = 1, 2,3. Show that the area of this triangle can be calculated by 
the formula: 


1 A? 


Ss = —-——_____, TA5 
2 abs(A j A2A3) ( ) 


a Biv 
where A = Iq By y2|» Ai are the cofactors of the element yj, i € {1, 2, 3}. 


a3 3 73 
Prove that the points 7)(—2, —8), 72(18, 2) and 73(3, —11/2) lie on the 
same line. 
For what values of the real parameter a do the points 7; (0, 1), T2(a, 2) and 
T3(3, a) lie on the same line? 
The coordinates of a triangle are given: A(1, —1), B(2, 4), C(—8, —1). Set 
up the equation of the line that passes through the vertex A parallel to the 
side BC. 
The coordinates of a triangle are given: A(—2, 0), B(2, 3), C(—1, —1). Set 
up the equation of the line that passes through the vertex B parallel to the 
side AC. 
Itis known about the point N that it lies on the ordinate axis, and the distance 
from this point to N’(—2, —5/2) is equal tod = 2,/2. Find the coordinates 
of the point NV. 
It is known that the area of the triangle is equal to S = 6 and its two vertices 
have the coordinates (1, 1) and (—2, —3). Find the coordinates of the third 
vertex of the triangle if this vertex lies on the abscissa axis. 
It is known that the area of the triangle is equal to S = 10 and its two vertices 
have the coordinates (—2, 3) and (—7, —1). Find the coordinates of the third 
vertex of the triangle if this vertex lies on the ordinate axis. 
Find the projection of the point (2, —13) on the line that passes through the 
points (0, 2) and (2, —8). 
Find the projection of the point (a, a) on the line that passes through the 
points (1, 2a) and (2, 3a), if a is an arbitrary real number. 
Write a program in Python that determines, by the segment endpoint 
coordinates, in which coordinate quadrants it is located. In contrast to the 
solution of the Example 7.5 on page 287, consider the full set of possible 
cases, including the one when the segment endpoints can belong to the 
coordinate axes. 
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Answers and Solutions 


7.1 Answer: (1, 2). 
7.2 Answer: (—5/2, —1/2), (—30/19, 78/19), (15, —18). 
7.3 Solution. 
Let us use the equation of a straight line through two given points (7.12): 


YTV X71 


y2— Yi X2 — X1 


Substituting the coordinates of the points from the problem statement, we obtain the 
equation of the triangle sides: 


x+y—-6=0, 2x+5y+10=0, 10x+21y+66=0. 


7.4 Solution. 

Denote the triangle vertices (x), y1), (x2, y2) and (x3, y3) by Aj, Az and A3, 
respectively. 

As is well known (see page 262), the area of an arbitrary tangle can be 


represented as half of the modulus of the vector product A; Az x A,A3: 


— 1 |x2 - _ 
Vevey ora ar 2—%X1 y2 Yay 
2 x3 —X1 Y3— yl 


This expression can be rewritten in the equivalent form: 


xpyil 
S = —abs : 
5 x2 y2 1 


x3 y3 1 
Thus, the formula (7.44) is proved. It implies that the necessary and sufficient 


condition of the three points belonging to one line is that the respective third-order 
determinant is equal to zero. 


7.5 Solution. 
Solving the systems of equations: 


xty+1=0, xty+1=0, x+2y—3=0, 
x+2y-—3=0; 4x —3y-2=0; 4x —3y-2=0, 
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we find the coordinates of the vertices of the triangle: (—5,4), (—1/7, —6/7), 
(13/11, 10/11). 

As is shown in Problem 7.4, the area of an arbitrary triangle A;A2A3, whose 
vertices have the coordinates (x1, y1), (x2, y2) and (x3, y3), respectively, is equal to 


xiyil 
S = —ab : 
7208 |x2 y2 1 


x3 y3 1 


In our case, 


1 ee 4 1156\ 578 
S= aaee —1/7 —6/7 1} = ques (=) = 


77 aT 
13/11 10/11 1 


Note. See Problem 7.20 for a general solution. 


7.6 Solution. 

The abscissa and the ordinate of the midpoint of the segment with the endpoints 
(x1, yi) and (x2, y2) are determined by the formulae x,, = (x1 + x2)/2 and x, = 
(y1 + y2)/2, respectively. 

Find the midpoints of the sides: (5/2, —1/2), (7/2, 4), (5, 3/2). 

Then, we apply the formula (7.12) and obtain the following equations of the 
medians of the triangle ABC: 


x+8y—17=0, 14x+y-—53=0, 13x—7y-—36=0. 


7.7 Solution. 

The general equation of a straight line through (—3, —20) and (4, 17) has the 
form 37x — 7y — 29 = 0. 

We obtain the normal equation of this line. The normalizing multiplier (7.32) is 
equal to 


1 1 
= ——————sgn(-29) = 
BPrR /1418 


Thus, the equation in normal form is 


37 7 29 


= y- — 


———-x 
V 1418 V 1418 V 1418 
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and the deviation of the point T (1, 7) from the line is equal to 


Therefore, the sought distance is equal to 


7.8 Solution. 
Bring the equation of a line to normal form. 
The normalizing multiplier is equal to 


1 1 
P= Oe 
V¥%0 + Yo Vv%o0 + Yo 


and, therefore, the equation of a line can be written in the form: 


sgn(xoyo), 


x x 
ey + a — ge sgn(xoyo) = 0. 
Vt Vxoty yr0+% 

Compute the deviation from the origin of coordinates (0, 0): 

Xo yo 


———— sen(xo yo). 
Vx +¥ 


The distance from the origin of coordinates to the line is equal to the absolute 
2 abs(xo yo) 


V+ ¥% 
7.9 Solution. 


1 
The deviation from the point (—4, —4) to the line -——x 
10 


2 
is equal to 6 = — 


[2 
Hence, the distance from the point to this line is equal to d = 3 3° it coincides 


§=-2 


value of the deviation: 


with the length of the side of the square. 
The area of the square is S = d* = 18/5. 
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7.10 Solution. 
The system of equations: 


x/a+y/b=1, 
x/b+y/a=1, 


for a, b ~£ 0 has the unique solution x = y = ab/(a+ b). 


Therefore, the lines intersect at the point ee ), 
a+b a+b 
7.11 Solution. 
Take the formula (7.19) in order to find the tangent of the angle between the lines 
specified in the general form. Substitute into it the values A; = 3, Bj} = 5, Az = —2 
and Bz = 1, and we obtain 


A2B, — A, Bo 
tana = ——————_- = 13. 
A,A2 + B, Bo 


Therefore, the angle between the lines is a = arctan 13. 


7.12 Solution. 
Consider the system of equations: 


Ax + 6y = 2, 
2x +3y =p. 


Using the bordering minor method (see page 64), we find the ranks of the system 
matrix and the augmented matrix: 


16 1, ifA=4, 
rk — 
23 2, frA4#4; 


262] _ [16 2 i: pea, 
Lr = 
23 x 00u—4 2, if #1. 


Therefore: 


(a) at’ 4 4, u 1, the system has the unique solution, and the lines intersect 
exactly at one point; 

(b) at’ = 4,u = 1, the system has an infinite set of solutions, and the lines 
coincide; 

(c) ata = 4, uw # I, the system has no solution, and the lines are parallel. 
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7.13 Solution. 
Find the deviation of each line from the origin of coordinates: 


jp = = Csgn(C), 62 = — 


1 1 
VA*+ B VA? + B? 
Then, as is easy to see, the distance d between the parallel planes is equal to the 
absolute value of the difference of the deviations: 


C’sgn(C’). 


5 Csgn(C) + 


1 
d = abs(3; =85) = abs (—> 
VA2+B A 


1 / / 
Teo ee) 
__ abs(C’ — C) 
~ SAT+ BE 


7.14 Answer: the criterion of intersection of n lines at one point is the equality of 
the ranks of the two matrices: 


A; By A; By Cy 
A2 Bo A2 Bo Co 
An By An By Ch 


7.15 Solution. 

The equation of the line L* that passes through the point T (xo, yo) has the form 
y — yo = k(x — xo). In this equation, k is the unknown slope. 

In order to find the value of k, let us use its property: k = tana*, where a* is the 
angle of inclination of L* relative to the abscissa axis. 

Since L* passes at the angle Aw to the line L, then the two options arise: a* = 
a — Aw anda* = a+ Aq. Write these equations in the form of a common equality: 


aX =atAa. 


By the formulae of tangent of sum and difference of two angles (B.13) and (B.14) 
on page 412, we have 


tana + tan Aa 
k = tan(a + Aa) = ————_—_—_. 
1 + tan Aa tana 


We finally obtain the equation of the line L*: 


tana + tan Aa 


= yo = ————_ (x — x0). 
yee earner, 0) 
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7.16 Answer: 


A2 Bo A3 B3 
(a) (Aix + Biy+ Cj) = (Aox + Boy + C2) : 
A3 B3 A, By 
(b) (Aix + Bry + C1)(A2A3 + Bo B3) = (Aox + Boy + C2)(A1A3 + By Bs); 
(c) Aix + Biy+ Ci _ _ ago 
Aj + Be / A5 + BS 
A, By|)A2 Bg 
where s = sgn 
A3 B3||A3 Bz 


7.17 Solution. 
Denote the intersection points of the line Ax + By + C = 0 with the coordinate 


Cc 
axes by P(xo, 0) and Q(0, yo), where x9 = ——, yo = ——. The coordinates xo 


and yo in the absolute value are equal to the lengths of the cathetuses of the right 
triangle PO Q lying on the axes Ox and Oy, respectively. The area of this triangle 
is equal to the half of the product of the cathetuses: 


Pome nem aes Cy 3€ 
== aN A B) 2AB’ 
7.18 Solution. 


Write the equation of the line that passes through the point (xo, yo): y — yo = 
k(x — xo), where k < oo is the slope. 

The case k — oo does not require separate consideration, since in that case the 
line will not intercept the triangle from the quadrantal angle. 

Find the coordinates of the points of intersection of the line with the coordinate 
axes: 

with the axis Ox: y = 0, —yo = k(x* — x0), x* = x0 — =. where abs(x*) is the 


length of the cathetus lying on the axis Ox; 
with the axis Oy: x = 0, y* — yo = —kxo, y* = yo — kx, where abs(y*) is the 
length of the cathetus lying on the axis Oy. 
The area of such a ia is equal to the an of the product of the cathetuses: 
(yo = kxo)” 
i 


1 
S= qabstx"y"), S= we o- “Joo —kxo) = abs = 


2 
Let us express the variable k: 


ke 4 — 2k(xoyo — S) + i = 0. The solution of this square equation leads to two 
possible values of k: 


xoyo — S£/SS(S — 2x0 yo) 
ki2 =- 


2 
Xo 
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We obtain the sought equation of the line: 


2 
Xo 


xoyo — S£V/7S(S — 2xoyo) 
y > yo = | ———_—_  & — X0). 
7.19 Solution. 
As is shown in the previous problem, the area of the triangle S depends on the 
1 (yo — kxo)? 
slope of the line k as follows: S(k) = =abs(—-——). 
For the points T (xo, yo), whose coordinates x9 and yo are positive, the equality 
k < Ois fulfilled, i.e. in this case the line forms an obtuse angle with the axis Ox. 
In order to find the minimal value of the function, let us compute the points at 
dS(k) yg —k?x5 
— , ae 
The condition k < 0 is satisfied by the value k* = = ihe point k* is the 
x0 
d* S(k* 2ye 
minimum point, since the second derivative is ale) =— %0 
dk (k*)3 
Thus, the least value of the area of the triangle is equal to 


which the derivative 


is equal to zero. 


> 0. 


Smin = S(k*) = 2xoyo- 


7.20 Solution. 

Compute the coordinates of the intersection points of the line pairs. Taking into 
account that the area of the triangle is expressed in the form of a half of the modulus 
of the vector form of the vectors that from both sides, we obtain 


Sa Biy2 — pov (= = 9372 a3y1 — ai) 
of B2 — 0281 \a382 — 0283 183 — 03 By 
B2y3 — Bsy2 (= =%M1y3 — 1y2 — ar) 
0263 — 0382 \a1B3— 038, a2B) — a1 62 

B3y1 — Biy3 C= | 


038, — 0183 \a2B) —a1B2 0382 — a2f3 


Reduce the fractions to a common denominator. After simple but somewhat 
cumbersome algebraic transformations, we arrive at the formula: 


= (a1 (Bo y3 — B3y2) + o2(B3y1 — Biy3) + 03(Biy2 — Boy)” 


S 
(a1 B2 — a2 81) (283 — a3 82) (038) — a1 B3) 
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As is easy to see, the obtained expression can be presented as 


@zd A? 
~ 2 abs(A,A2A3)’ 


a Bi V1 
where A = |qy Bo y2|, Ai are the cofactors of the elements y;,i € {1, 2, 3}. 


a3 B3 73 


7.21 Solution. 

Let us draw a line through the points T; and 7) (see the formula (7.12)). The 
slope of this line is k = 1/2. Then, the line 7,73 has the slope k’ = 1/2. Since 
k =k’, then the points 7,, Tz and T3 lie on the same line. 


7.22 Solution. 

The first method 

Let the equation of the line passing through the three points 7;, Tz and 73 have 
the form y = kx +b. Having substituted the coordinates of each of these points into 
the equation of the line, we obtain the system of relatively unknown variables k and 
b: 


b=1, 
ak+b=2, 
3k+b=a. 


The condition of definiteness of this system (i.e. the uniqueness of its solution) 
leads to the square equation a” — a — 3 = 0. 

Therefore, the points 7|, T> and T3 lie on the same line at the two values of the 

1 1 

parameter a: aj = aut + /13) and a2 = ri — /13). 

The second method 

Let us use the consequence of the formula (7.44) from Problem 7.4: the criterion 
of the three points belonging to one line is that the third-order determinant is equal 
to zero: 


011 
a21|=0. 
3al 


The obtained equation has two roots: a1,2 = la + 4/13). 

Note. The necessary and sufficient condition for the three points 7}, T2 and T3 
to lie on the same line is collinearity of the vectors Tid and TT. In view of this, 
there exists one more method of solving this problem, which is based on checking 
whether the vector product ee x TT is equal to zero. Of course, different solution 
methods result in the same answer. 
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7.23 Solution. 
The slope of the sought line coincides with the slope of the line BC. The equation 


1 
of the line BC has the form y = 5* + 3 (see the formula (7.12)). Then, use the 
formula (7.10) for the line that passes through a given point: 


y-y =k —%1), (7.46) 


1 
wherekK = -,x, = 1,y,; =~—1. 
Substituting the numeric values, we obtain the equation of the line that passes 
through the vertex A parallel to the side BC: x —2y —3 = 0. 
7.24 Answer:x + y—5=0. 


7.25 Solution. 
Since the point N lies on the ordinate axis, its coordinates are equal to (0, y), 
where y is an unknown value. The distance d between the points N and N’ is equal 


to 
d= 0=(—DYP + = 5/2) = 44+ + 5/2). 


Thus, we obtain the equation relative to the variable y: 


(0 — (—2))? + (y — (—5/2))? = 2V2, 


which has the solutions yj = —1/2 and yz = —9/2. Therefore, the coordinates of 
the point N are equal to (0, —1/2) or (0, —9/2). 


7.26 Solution. 
By the triangle area formula expressed by the coordinates of its vertices (1, 1), 
(—2, —3) and (x, 0) (see (7.44) on page 290), we obtain S = abs(4x — 1). 
According to the problem statement, S = 6, hence abs(4x — 1) = 13/4. 
Therefore, the coordinates of the vertices of the triangle are equal to (—5/4, 0) 
or (7/4, 0). 


7.27 Solution. 
The coordinates of the third vertex are (0, y). 
The area of the triangle is equal to 


i 231 , 
S= a —7-11\/= Fu — 5y) = 10, 
0 yl 


hence y = 13/5 or y = 33/5. 
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The coordinates of the third vertex of the triangle are equal to (0, 13/5) or 
(0, 33/5). 


7.28 Solution. 

Let A(0, 2), B(2, —8). The sought projection of the point is K (2, —13). 

The projection of the point is the foot of the perpendicular to the line AB. Then, 
the orthogonality property for the line AB and the perpendicular from the point K: 
ky - ko = —1 is fulfilled. 

Write the system of equations relative to the coefficients k; and b, for the line 
AB: 

2=b), 


8 = 2ki + by, 
From this system, we obtain that b} = 2,k, = —5. Then, ko = 1/5. 
Substitute kz into the equation of the line for the perpendicular from the point K: 
—13 =2-1/5+ 2, hence by = —67/5. 
Compute the coordinates of the point (xo, yo), at which the lines intersect: 
1/5x9 — 67/5 = —5x0 + 2, x9 = 77/26, yo = —333/26. 


7.29 Solution. 

Let A(1, 2a), B(2, 3a), K(a,a). 

Projection of a point is a foot of a perpendicular to the line AB. Then, the equality 
is fulfilled for the line AB and the perpendicular from the point K: kj - kz = —1. 

Write the system of equations relative to the coefficients k; and b; for the line 
AB: 


2a =k +b, 
3a = 2k, + dy. 
From this system, we obtain that b} = a, kj = a. Then, kz = —1/a. 


Substitute kz into the equation of the line for the perpendicular from the point K: 
a = a(—1/a)+ b2, hence bp =a + 1. 
The coordinate of the projection of the point K satisfies the equation: 


axgt+a = —1/axot+b2, axota = —xo/atatl, xo(a+1/a) = 1, x0 = a 
a 
1 a 
Let us express the ordinate of the projection: yy = ——~——-+a+l = a+————.. 
7 a ii atl ae 


a a 
Th ht projection has th dinat ——. ; 
e sought projection has the coordinates (= ra a+ ak :) 


7.30 Solution. 

The suggested solution generally repeats the approach discussed in Example 7.5. 
Note that in order to study the full set of possible cases of mutual arrangement of 
the segment endpoints and the coordinate axes, the integer variable neighbor 
is additionally used. It plays an important role for the location of the segment 
illustrated in Fig. 7.8. Here, one of the segment endpoints, namely P2, is located 
on the coordinate axis, and into the variable neighbor will be written the number 
of the quadrant where the segment points from the small neighborhood P) lie. 
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y 
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P, 
O = 
P 
Ill IV 


Fig. 7.8 To Problem 7.30. The segment P;P2 lies in the second and third quadrants, 
get_quadrant (pl) = 3, get_quadrant (p2) = 0, neighbor = 2 


After performing all checks, into the answer will be written the numbers of the 
quadrants within which the segment points fall. 


class Point: 
def init (self, x, y): 
self.x = x 
self.y =y 


def get quadrant (p): 
if p.x > 0 and p.y > 0: 
return 1 
elif p.x < 
return 
elif p.x < 


O and p.y > 0: 
2 
0 
return 3 
0 
4 


AN 
[o) 


and p.y 


N 
(o) 


elif p.x > 
return 
else: 

return 0 


and p.y 


def get _quadrants general(pl, p2): 
pl_quad = get quadrant (p1) 
p2_quad = get_quadrant (p2) 
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is adjacent = abs(pl_ quad - p2 quad) % 2 == 1 


= 0": 
2, play spay) 7 2) 


if pl_quad == 0 and p2_quad = 
mid = Point ((pl.x + p2.x) / 
mid_quad = get quadrant (mid) 


return [] if mid_quad == 0 else [mid_quad] 
elif pl_quad == 0: 
if pl.x * p2.x < 0: 
neighbour = 3 - p2 quad \ 
if p2_ quad <= 2 else 7 - p2_ quad 


return [p2_ quad, neighbour] 


if pl.y * p2.y < 0: 
neighbour = 5 - p2_quad 


return [p2_ quad, neighbour] 


return [p2 quad] 
elif p2 quad == 
if pl.x * p2.x < 0: 
neighbour = 3 - pl_quad if pl_quad <= 2 \ 
else 7 - pl_quad 


return [pl quad, neighbour] 


if pl.y * p2.y < 0: 
neighbour = 7 - pl_quad 


return [pl_quad, neighbour] 


return [pl_quad] 
elif is adjacent: 
return [pl_quad, p2_ quad] 
elif pl_ quad == p2_ quad: 
return [pl_quad] 
else: 
b = (pl.y * p2.x - pl.x * p2.y) \ 
/ (p2.x - pl.x) 


return [pl_quad, p2 quad] 

elif pl_quad == 1 or p2_ quad == 3: 
quadrant = 2 if b > 0 else 4 
return [pl_quad, p2 quad, quadrant] 
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else: 
quadrant = 1if b > 0 else 3 
return [pl_quad, p2 quad, quadrant] 


Chapter 8 M®) 
Equation of a Plane in Space od 


8.1 Equation of a Plane That Is Orthogonal to the Specified 
Vector and Passes Through the Specified Point 


Assume that it is known that the plane z is orthogonal to the vector nm = (A, B, C) 
and passes through the point 7o(xo, yo, Zo). Take an arbitrary point T (x, y, z) on the 


plane z. The vector iW belongs to the plane z. From the condition of orthogonality 
of the vector n of the plane z follows that the vector 


— 
ToT = (x — x0, y — yo, Z — Zo) (8.1) 


is orthogonal to the vector n. 
Relying on the property of the scalar product of orthogonal vectors, we can write 


— 
n-ToT = 0. (8.2) 


This equation is called the vector equation of a plane [8]. 
Rewritten in coordinate form 


A(x — x0) + Bly — yo) + C(z — zo) = 9, (8.3) 


this equation is called the equation of a plane that is orthogonal to the vector 
n = (A, B, C) and passes through the point 7o(xo, yo, Zo). 
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8.2 General Equation of a Plane 


Equation of the first degree 
Ax + By+Cz+ D=0, (8.4) 


in which A, B, C and D are arbitrary real constants such that out of the coefficients 
A, B and C at least one is other than zero, is referred to as the general equation of 
a plane. 

A general equation (8.4) is referred to as complete if all its coefficients are other 
than zero. If at least one of the mentioned coefficients is equal to zero, the equation 
is referred to as incomplete. 

Consider the possible types of incomplete equations. 


(1) If D = 0, then the equation Ax + By + Cz = 0 determines the plane that 
passes through the origin of coordinates. 

(2) If A = O, then the equation By + Cz + D = O determines the plane 
parallel to the axis Ox, since the normal vector of this plane n = (0, B, C) is 
perpendicular to the axis Ox. 

(3) If B = 0, then the equation Ax + Cz + D = 0 determines the plane, parallel 
to the axis Oy, since its normal vector n = (A, 0, C) is perpendicular to the 
axis Oy. 

(4) If C = 0, then the equation Ax + By + D = 0 determines the plane parallel 
to the axis Oz, since for this axis perpendicular is the normal vector with the 
coordinates (A, B, 0). 

(5) If A= B = 0, then the equation Cz + D = 0 determines the plane parallel to 
the coordinate plane x Oy. 

(6) If A = 0 and C = O, then the equation By + D = 0 determines the plane, 
parallel to the coordinate plane x Oz. 

(7) If B = O and C = O, then the equation Ax + D = 0 determines the plane, 
parallel to the coordinate plane y Oz. 

(8) If A = 0, B = Oand D = O, then the equation of the plane Cz = 0 determines 
the coordinate plane x Oy. 

(9) If A= 0, C = Oand D = 0, then the equation of the plane By = 0 determines 
the coordinate plane x Oz. 

(10) If B = 0, C = Oand D = 0, then the equation of the plane Ax = 0 determines 
the coordinate plane y Oz. 
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8.3 Intercept Form of the Equation of a Plane 


Consider the general equation of the plane Ax + By + Cz+ D=0. 
Assume that all the coefficients A, B, C and D are other than zero. Then this 
equation can be written in the form: 


x y z 


(-D/A)  (—D/B) (-D/C) 
: ; D D D : 
Let us introduce the notations: a = igs b= FF c= -& Then Eq. (8.5) will 
be reduced to the following form: 
x Zz 
dee es ih (8.6) 
a bec 


This is the intercept form of the equation of a plane. 

In Eq. (8.6) the numbers a, b and c have simple geometric meaning: they are 
equal in absolute value to the lengths of the segments (intercepts) that the plane 
intercepts on the coordinate axes Ox, Oy and Oz, respectively (see Fig. 8.1). The 
plane passes through the points (a, 0, 0), (0, b, 0), (0, 0, c). 


Fig. 8.1 The intercepts that are intercepted by the plane on the coordinate axes 
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8.4 Normal Equation of a Plane 


Consider the plane shown in Fig. 8.2. 


Assume that p is the length of the vector OP: a, B, y are the angles between the 
unit vector m and the coordinate axes; Q is the arbitrary point on the plane with the 
coordinates (x, y, Z). 

It is obvious that the projection of the vector 00 on the direction 7 is equal to p: 


Pr, OO = p, (8.7) 
00 
n 
Pr, OO = . ei: (8.8) 
|n| 
Therefore 
00 -n=xcosa+ycosh +zcosy = p. (8.9) 


We have obtained the normal equation of a plane. The variables cos a, cos 6B, cos y 
are called the direction cosines of the vector n. 
Take the general equation of a plane 


Ax+ By+Cz+D=0, (8.10) 


where D # 0. 
Fig. 8.2 To the derivation of 


the normal equation of a 
plane 
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The normalizing factor is calculated by the formula: 


1 
a _ et (8.11) 
e /A2 + B24 C2 8 


(cf. the formula (7.32)). 
If we multiply (8.10) by the normalizing factor jz, then as a result we obtain the 
normal equation of the plane: 


u(Ax + By + Cz + D) =0. (8.12) 


Note If the condition D = 0 is met, then the plane passes through the origin of 
coordinates, and the normalizing factor can be taken with an arbitrary sign: «= 


+—_______ 
Vat B+ 
8.5 Equation of a Plane That Passes Through the Specified 
Point Parallel to the Two Specified Vectors 


Let the plane z be parallel to the vectors aj = (k1,1;,m 1) and a2 = (ko, 12, m2) 
and pass through the point 7o(xo, yo, Zo). Further, let T(x, y, z) be an arbitrary point 
on the plane. Then the vectors a, a2 and Tot are coplanar. Therefore, 


— 
(ToT x aj) - ax = 0, (8.13) 


x — X0 Y— YO Z — £0 


ky ly My = 0 (8.14) 
ky ly m2 
or 
ly my ky my, kh 
(x — Xo) —(y- yo) + (Z— 20) = 0. (8.15) 
ln m2 ky m2 kp ly 


After expanding the second order determinants and introducing the appropriate 
notation, we obtain the general equation of the plane. 
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8.6 Equation of the Plane That Passes Through the Three 
Specified Points 


Given are the following points: 7) (x1, y1, 21), 72(x2, y2, 22) and 73(x3, y3, Z3). 


: . — —> 
Take an arbitrary point T(x, y, z) and construct the vectors 7;T, 7,72 and 7,73. 
They are coplanar, therefore 


X—X1] Y-Yl ZZ) 


x2 — x1 y2- yy 22 —21| = 0. (8.16) 


X3 — X1 Y3 — V1 23 — Z1 


The obtained equation is the equation of a plane that passes through the three 
specified points. 


8.7 Angle Between Two Planes 


Consider the planes specified by the equations: 
Aix+ Bhy+Ciz+ DD; =0, (8.17) 
A2x + Boy +C2z+ D2 = 0. (8.18) 
For the given planes, construct the normal vectors ny = (A;, By, C,) andnz = 


(Az, Bo, C2). Therefore, the angle w between the planes will be determined from 
the relation: 


— Aye ng A,A2 + By Bo + Ci C2 
n n 2 2 2 2: 2 2 
milla] [a2 +B? 4+.C?,/A2 + B2 + C2 


If 2, - m2 = O, then the vectors m, and nz are orthogonal. Hence, 


COS W (8.19) 


A,A2+ B, Bo + C, C2 = 0 is the condition of orthogonality of planes. (8.20) 


: . Ar By CG F 
But if the equalities —- = — = — are valid, then the vectors mn, and m2 are 


2 By 2 
collinear, and this will be the condition of parallelism of planes. 
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8.8 Distance from a Point to a Plane 

The concept of deviation of the point 7o(xo0, yo, zo) from the plane Ax + By + 
Cz + D = Ois introduced similarly to the relation (7.37): 


Axo + Byg + Czo + D 
ore ars ae 
The distance d from the point 7p to the plane is determined as the absolute 


value of the deviation d = abs(6). 
If the plane is specified in the normal form, then the distance d is determined as 


§ = —sgn(D) (8.21) 


d = abs(cosa- x9 + cos B- yo + cosy -Z0 — Pp), (8.22) 


where cos a, cos §, cos y are the direction cosines. 


8.9 Pencil of Planes 


Assume that two planes are specified by the equations: 


Aix+ Byy+Ciz+ D; =0, 
A2x + Boy +Crz+ D2 = 0. 


(8.23) 


If these planes are neither parallel nor coincide, then they intersect on some straight 
line. 

It is obvious that for any real constants A and jz, the plane determined by the 
equation 


A(Aix + Bry + Cz + D1) + w(Aox + Boy + Coz + D2) =0 (8.24) 


will also pass through this line, because all the points governed by Eqs. (8.23) satisfy 
Eq. (8.24) as well. The same equation specifies all the planes that pass through a 
common line. 

A collection of planes passing through the same straight line is called the pencil 
of planes. 


Example 8.1 Find the equation of the plane that passes through the point 
T (4, —1, 2) and a straight line that is the intersection of the planes x+3y—z—5 = 0 
and —2x++y+z+4=0. 
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Solution Write the equation of the pencil of planes and fit 4 and yz so that the 
required plane passes through the point T: 


Mx+3y—-—z-—5)+u(-2x+y+z24+4 =0, (8.25) 
A(4+3-(-1l)-2-—5)+m(-2-44+ (-1)+24+4 =0, (8.26) 

— 6A —3n=0, (8.27) 

20+ w= 0. (8.28) 

For example, take the values A = 1 and yz = —2. In this case, we obtain the answer: 
(x + 3y —z—5)+ (—2)(—2x + y+ z+ 4) =0, or (8.29) 

5x +y—3z—-13=0. (8.30) 


Review Questions 


1. Write the equation of a plane orthogonal to a specified vector and passing 
through a specified point. 

2. Define the general equation of a plane? 

3. Write the intercept form of the equation of a plane. 

4. For solution of what problem is it convenient to use the normal form of the 
equation of a plane? 

5. How are the directing cosines of the vector a computed? 

6. Write the equation of a plane that passes through a specified point parallel to 

two specified vectors. 

. Write the equation of a plan that passes through three specified points. 

. How is the angle between two planes computed? 

. Define the deviation of an arbitrary point from a given plane. 

. What is pencil of planes? 


— 
oO MON 


Problems 


8.1. Set up the equation of the plane that passes through the origin of coordinates, 
if it has the normal vector n = (1, 2, —3). 

8.2. Set up the equation of the plane that passes through the point T(—1, 0, 2) 
and has the normal vector nm = (—3, —2, 0). 
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8.3. 


8.4. 


8.5. 


8.6. 


8.7. 


8.8. 


8.9. 


*8.10. 


8.11. 


8.12. 


*8.13. 


Find the equation of the plane that passes through the points 7; (2, —1, 0) 
and 7>(—5, 1, 1) parallel to the vector a = (0, —1, —7). 

Find the equation of the plane that passes through the point 7o(2, —1, 0) 
parallel to the vectors aj = (3,5, —8) and az = (4, 6, —7). 

Find the equation of the plane that passes through the point 7o(1/2, 1/2, 1/2) 
parallel to the vectors a; = (0, 1, —1) and a2 = (1, 1, 10). 

Find the lengths of the segments intercepted by the plane 3x+4y+5z—12 = 
0 on the coordinate axes. 

What is the distance from the point A(1, 2, 9) to the plane x + y—2z-—17 = 
0? 

Does the plane —2x + 2y — z — | = 0 intersect the segment P; Po, if the 
segment endpoint coordinates are as follows: P|(—5, —5, —5), P2(8, 8, 8)? 
Compute the distance between the parallel planes specified by the equations 
Ax + By +Cz+ D=Oand Ax + By + Cz+ D’ =0, where D # D’. 
On what condition do the three planes Ayx + Byy + Ciz+ D, =0, Aox + 
Boy + Coz + D2 = Oand A3x + B3y + C3z + D3 = 0 intersect exactly at 
one point? Find the coordinates of this point. 

Find the volume of a tetrahedron intercepted by the plane Ax + By + Cz + 
D = 0 from the quadrantal angle. 

On the axis Oz find the points equidistant from the two planes x — y + z — 
10=Oandx+y—z+8=0. 

Compute the volume of a tetrahedron whose vertices are located at the points 
with the coordinates (x1, y1, Z1), (%2, Y2, 22), (%3, Y3, Z3) and (x4, y4, Z4). 
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8.1 Solution. 
Use the formula (8.3) that expresses the equation of the plane that passes through 
the point Ao(xo, yo, Zo) perpendicular to the vector n = (11, n2, 3): 


n(x — x0) + n2(y — yo) +13(z — zo) = 0. 


Substitute the coordinates of the point O (0, 0, 0): (x—0)+2(y—0)—3(z—0) = 0. 
Hence, the equation of the sought plane has the form x + 2y — 3z = 0. 


8.2 Solution. 
Substitute into the formula (8.3) the coordinates of the point T(—1, 0, 2) and the 
vector nm = (—3, —2, 0); we obtain the equation of the plane: 


or 


(—3)@ — (-D)) + (2) — 0) + 0 — 2) = 0, 


3x+2y+3=0. 
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8.3 Solution. 
Select an arbitrary point T(x, y, z) on the sought plane. According to the problem 
—> 


statement, the vectors 7; 7, T;72 and a are coplanar, therefore, their scalar triple 
product is equal to zero: 


—> => 
(TT, T, Tz, a) = 0. 


Use the formula (6.33): 
x-2y+1z 
———? 
(T\T,T|T,,a4)=| -7 2 1)/=—13x—49y+7z-23=0. 
Oo -1 7 


Then, the equation of the plane that passes through the points 7; and 7> parallel to 
the vector a can be presented in the form 13x + 49y — 7z+ 23 =0. 


8.4 Solution. 
Consider an arbitrary point T(x, y, z) on the sought plane. The scalar triple 


product of (aj,a@2,7\T) is equal to zero because of coplanarity of these three 
vectors. 
Having computed the scalar triple product, we obtain 


3 5 -8 
— 
(aj,a2,ToT)=| 4 6 —7| = 13x - lly —2z-—37=0. 
x—-2y4+1 z 


Hence, the equation of the plane that passes through the point 7p parallel to the 
vectors a; and a2 can be presented in the form 13x — lly — 2z —37=0. 


8.5 Solution. 
Using an auxiliary point T with the coordinates T(x, y, z), similarly to the 
solution of the previous problem, we obtain 


(diya, Te?) = 0, 


0 1 -1 


9 
1 1 10 =llx-—y-z-==0. 
1 1 1 2 
an Ta ane) 


As a result, the equation of the plane that passes through the point 7 parallel to the 
vectors a; and az has the form 22x — 2y —2z-9=0. 
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8.6 Solution. 

Assume that the general equation of the plane Ax + By+Cz+D = Ois specified, 
and D # 0. Pass on to the intercept form of the equation of a plane, dividing both 
sides of the equation by —D: 


A B Cc 1 
Dd DP DO 
: D D 
The variables a = =e b = —— and c = —— are modulo equal to the lengths 


of the segments intercepted by the plane on the coordinate axes Ox, Oy and Oz, 
respectively (see the formula (8.6)). 
Having substituted the values of the coefficients A = 3, B = 4,C =5, D= 


—12, we obtaina = 4,b=3,c= =" 


8.7 Solution. 
Write the equation of a plane in normal form: 


1 


(x+y-—2z-17)= Fe 


(x + y—2z-—17) =0. 


1 
Vl? + 1? + (—2)? 


Substitute the coordinates of the point from the problem statement into the normal 
equation and find the distance from this point to the plane: 


1 32 
d = abs (a+ 18 1) ie 
8.8 Solution. 

The plane divides the space into two parts: —2x +2y—z—1 > Oand —2x+2y— 
z—1 <0. Iftwo endpoints of the segment are located in different parts of the space, 
then it is obvious that the segment intersects the plane. Substitute the coordinates of 
the points from the problem statement: 


—2-(—5)+2-(—5)— (-5)-1=4>0, 
-2-8+2-8-8-1=-9<0. 


Therefore, the points P; and P> are situated on the opposite sides of the plane, and 
the segment P| P> intersects the plane —2x + 2y —z—-1=0. 


8.9 Solution. 
Find the deviation of each plane from the origin of coordinates: 


1 1 
§ = ——————_D sD), 2 = ————————_D'sgn(D’). 
JA? + B24 C2 . VA? + B? + C? 
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Then the distance d between the parallel planes is equal to the absolute value of 
the difference of these deviations: 


d= abs(61 = 62) 


Dsgn(D) + D'sgn(D')) 


1 
AP + BP CP 


= abs( — 


1 
AP Bee? 
___abs(D’ — D) 
SAP + BE + C2 


8.10 Answer: 
The condition of three planes intersecting at one point is that the determinant is 


other than zero: 


A; Bi Ci 
Ai = |Ao Bo Co| #9. 
A3 B3 C3 


The coordinates (xp, yp, zp) of the intersection point of these planes are 


D, By Cy i A; Di C\ Fl A, Bi Di 
<P =~ |D2 Bo C2}; yp = — 5 |A2 D2 C2) ip =~ |A2 Bo D2! - 
D3 Bz C3 A3 D3 C3 A3 B3 D3 
Note that if A = O and at least one second order minor of the matrix 
A, By Cy 


Az Bo Cz | is other than zero, then all planes are parallel to the same line. But 


A3 B3 C3 
if all the second order minors are zeroes, then the planes have the common line. 


8.11 Solution. 
Pass on to the intercept form of the equation of a plane: 


ABC, 

D> Up Up 

D D D 
=—— = 4, == = 0; a=, 

A B C 


where a, b and c are modulo equal to the lengths of the segments intercepted on the 
coordinate axes by the plane and coinciding with the edges of the tetrahedron. The 
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volume of the tetrahedron Viet, and the volume of the parallelepiped Vpar are bound 
as follows: 


1 
Vietr = 6 Vpare where Vpar = abs(abc). 


Substitute the values of a, b and c: 


1 D3 
Vietr = Peo ABC)’ 
8.12 Solution. 


Find the normal equations of the planes: 


1 

—(x —y+z-—10) =0, 

3, : 

—-—(*+ y-—z+8)=0. 
Wek y ) 


The coordinates of the point located on the axis Oz are (0, 0, zo), where zo € R. In 
order to find the distance from this point to the plane, substitute the coordinates into 
Eqs. (8.22): 


abs(zo — 10) _d 
J3 1> 
abs(—zo + 8) _ 


d. 
We 2 


Equate the distances: 
abs(zo — 10) = abs(—zo + 8) > zo = 9. 


So, the problem statement is satisfied by a point with the coordinates (0, 0, 9). 


8.13 Solution. 
Denote the vertices of the tetrahedron (x;, yj, zi) by Aj, respectively GG = 
1,2, 3,4). The volume of the tetrahedron is one sixth part of the absolute value 


: —_>—=>_- -———>  ——> 
of the scalar triple product (A;A2, A1A3, Aj Aa): 


X2—X1 Y2—- Yi 22-21 
1 rs 1 
V= GabstAi Az, A, A3, Aj Aq) = Pau x3—%X1 Y3—y1 73 —Zi|- 


X4—X1 y4— Yl 24-21 
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Note that this expression can be rewritten in equivalent form: 


x1 y1 Z1 1 

1 Xx z2 | 

V = —abs Pine ae 3 
x3 y3 73 1 


x4 y4 74 1 


Chapter 9 M®) 
Equation of a Line in a Space od 


9.1 Equation of a Line That Passes Through the Specified 
Point Parallel to the Specified Vector 


Assume that the line L passes through the point 7o(xo, yo, Zo) parallel to the vector 
a = (k,1,m). Take an arbitrary point T(x, y, z) on the line and construct the vector 
ToT = (x — x0, Y — Yo, Z — Zo), parallel to the line Z and collinear with the vector 
a. Then, we can write the equation 


xX—-*X0  Y—yo 2-20 


k i m 7 


This relation is referred to as the canonical equation of a line that passes 
through the specified point parallel to the specified vector, and the vector 
a = (k,1,m) is referred to as the directing vector. 

From the canonical equation (9.1), we can easily derive the parametric equation 
of such a line: 


x —x9 = ku, 
y-yo=lu, (9.2) 


Z-Z =mu, 


where u € R is the parameter of the line. 


Example 9.1 Let us find the equation of the line that passes through the point 
T (3, —1, 0) perpendicular to the plane x — 4y + 7z—- 10 = 0. 
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Solution The normal vector of the plane n = (1, —4, 7) will in this case coincide 
with the directing vector for the line: 


w= eel 


z 
i z= 7 (9.3) 


Example 9.2 Let us find the canonical equation of the line that is the intersection of 
the planes determined by the equations —x — y+ z+2=0Oand2x+4y—z=0. 


Solution The line is orthogonal to the normal vectors of each of the planes: ny = 
(-—1, —1, 1), m2 = (2,4, —1). Therefore, as the directing vector of the line, we can 
takea =n, X No. 
ij k 
a=|-1-1 1|=—-3i+j-—2k, a=(-3,1,-2). (9.4) 
2 4 -1 


As the coordinates of the point that lies on the line, we select any solution of the 
system 


—-x-y+z+2=0, 


(9.5) 
2x+4y—z=0. 
Let z = 0, then 
—-x-y=-2, 
>x=4, y= -2. (9.6) 
2x +4y = 0, 
Therefore, the point T (4, —2, 0) lies on the line. 
Therefore, the equation of the sought line has the form: 
-—4 2 
site eng (9.7) 


—3 1 =2 
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9.2 Equation of a Line That Passes Through the Two 
Specified Points 


Assume that the line Z passes through the points 7) (x1, y1, Z1) and T2(x2, yo, Z2). 
Then, the vector a = 7,72 = (x2 — x1, y2— y1, 22 — Z1) is parallel to the line L, and 
the equation of a line that passes through the two specified points in canonical 
form looks as follows: 


MR! I 2 (0.8) 
x2-xX1 yo-M 2-2 


Example 9.3 The line that passes through two points with the coordinates 
(11, 2, —6) and (13, 0, 7) is represented by the equation: 


: ome | cs So 


5 aa (9.9) 
9.3 Angle Between Two Lines 
Consider two lines 
a ee (9.10) 
ky q my 
ee ei eg eng (9.11) 


The angle between two lines w will be equal to the angle formed by the directing 
vectors ay = (kj, /,,mj,) and a2 = (ko, 12, m2): 


5 kik yl 
mnareeoe (a1 a2) _ geceng f= Seg Mie TE — (9.12) 


jai l|a2| Vp ti + mil kg +5 + m3 


Example 9.4. Given are the lines: 


5 Sea ee a ge a 
ee: eee | 


, (9.13) 


RD YO. ez 
nn | 


: (9.14) 


Show that these lines are collinear and find the angle between them. 
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Solution The necessary and sufficient condition of collinearity of the lines is the 


equality to zero of the scalar triple product (a1, a2, T;T2), where a; = (6,3, —1), 
a2 = (2, 1, —1) are the directing vectors of the lines, and the points 7; and T> have 
the coordinates 7; (4, —5, —1), 72(2, —6, —2). 

Compute (a1, a2, T) 72): 


6 3 -l 
= 
(a;,42,772)=|2 1 —1)=0. (9.15) 
—2-1-1 


Therefore, the lines are collinear. The angle w between them is determined from 
the condition 


cosa = _— 6-243-14+C)- CD) a, (9.16) 


J@+32+(—D2V2+2 +12 VO 


As aresult, we obtain the angle between the two lines 


8 
@ = arccos ——. (9.17) 
69 


9.4 Angle Between a Line and a Plane 


Assume that the line Z and the plane z intersect at some point, and assume that the 
vectors are given: n = (A, B, C) is the normal vector of plane 7, and a = (k, 1, m) 
is the directing vector (see Fig. 9.1). 


Fig. 9.1 The angle aw 
between a line and a plane 
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If the line and the plane form the angle a, and the vectors n and a for the angle 
B, thena +B = >: Then, 


(n-a) Tu : 
ee = —-a)= ; 9.18 
cos B alla cos B = cos (5 a) sina ( ) 


and the angle a between the line and the plane is determined from the condition: 


_ (n-a) | Ak+ Bl+Cm 
Inlla|  JA24+ B24 C2VR +2 4 m2 


Condition of parallelism of the line L and the plane z (including the belonging 
of L and z): 


(9.19) 


Ak+ Bl+Cm=0. (9.20) 


Condition of perpendicularity of the line L and the plane z: 


eS 9.21 
k I m ( ) 


a c 
(Here, the equalities of the form — = — are understood in terms of ad = bc.) 


We obtain the condition of belonging of the line 


X—-X1 YY 2721 


22 
k l m ee) 


to the plane Ax + By + Cz+D=0. 

For this, it is necessary and sufficient that the point 7) (+1, y1, z1) should lie on 
the plane, and the vectors n = (A, B, C) anda = (k, 1, m) should be perpendicular 
to each other. Therefore, the condition of the line’s belonging to the plane consists 
in the fulfilment of the equalities: 


Ax, + By; + Cz, + D=0, 
Ak + BI+ Cm =0. 


(9.23) 
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9.5 Condition of Two Lines’ Belonging to a Plane 


The two lines: 


~X—xX1 YY £— £1 


Ly = , passing through the point 7; (x1, y1, z1), and 
ky I mM 


: ce ree ae aa = passing through the point 7>(x2, y2, Z2), 
m2 


in space can 


1. intersect; 
2. be parallel; 
3. be skew. 


In the first two cases, they lie in the same plane. The two lines that do not intersect 
and are not parallel are called skew lines. 

The necessary and sufficient condition of belonging of the lines L; and L2 
to the same plane consists in coplanarity of the vectors a; = (kj,l1,m 1), az = 


— 
(kz, 12, m2) and T, Tz = (x2 — x1, y2 — y1, Z2 — Z1), i.e. the determinant’s equality 
to zero must be valid: 


x2 X1 y2 Y1 22 Z1 
ky hy my = 0. (9.24) 


kp ly m2 


Condition of parallelism of two lines: 


ky ly my 


ee 9.25 
kg lp m2 se) 


For intersection of lines, it is sufficient that at least one of the equalities (9.25) 
is violated and the condition (9.24) is valid. 


Example 9.5 Find the planes that pass through the line 


ee ee eee 


= 9.26 
—8 —1 3 ae 

and are orthogonal to the plane —3x + y+ z+3=0. 
Solution The directing vector of the given line a = (—8, —1, 3) is the normal 


vector to the plane n = (—3, 1, 1). 
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Fig. 9.2. To Example 9.5. 
Mutual arrangement of the 


— 
vectors a and ToT 


The point 7o(4, 8, —1) belongs to the sought plane since it lies on the line (see 
Fig. 9.2). 

Write the equation of the plane that passes through the point T and is parallel to 
the two vectors a and n (the vectors a, n and To? are coplanar): 


x-4y-8z4+1 
-§ -1 3 |=0, (9.27) 
-—3 1 1 


hence: 


4a~-—4)-(y-8)-11@+1)=0. (9.28) 
Thus, we obtain the sought equation of the plane: 


4x +y+11z—13=0. (9.29) 


Example 9.6 Find the equation of the plane that passes through the two parallel 
lines: 


aU ya Ve 2 Wi e=—Uy Youve 2-W 
kd m ” a a a 


(9.30) 


and at least one of the inequalities is valid: Uj 4 U2, Vi 4 V2, Wi 4 W. 


Solution The sought plane passes through the points 7)(U;,Vi,W ,) and 
To(U2, V2, W2) and is parallel to the directing vector a = (k,/,m). On the other 


hand, this plane is parallel to the vector 7; 72 = (U2 — U1, V2 — V2, W2 — Wj). 
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Let T(x, y, z) be the current point of the plane. 
—> 
Write the condition of coplanarity of the vectors T; T, a and T 72: 


x-U; y-VYi z-Wi 
k I m |=0. (9.31) 
U2 — U; V2 — Vi W2 — Wi 


The relation (9.31) sets up the equation of the plane that passes through two 
parallel lines. 


Review Questions 


1. Write the equation of a straight line that passes through a specified point parallel 
to a specified vector. 

2. What does the equation of a line in space that passes through two specified points 
look like? 

. Write the formula for computing an angle between lines in space. 

. How is an angle between a line and a plane computed? 

. Formulate the condition of parallelism of a line and a plane. 

. Write the condition of perpendicularity of a line and a plane. 

. What lines are called skew lines? 

. What is the condition of parallelism of two lines in space? 


oOonNNMN SW 


Problems 


9.1. Find the intersection points of the line specified by the equation 
2x +4y+z+9=0, 
4x —6y —2z+1=0, 


with the coordinate planes. 
9.2. Find the equation of the plane that passes through the two parallel lines: 
x—2 
2 


eel #7 l 21 


-3’ 4 6 —6 


A 
3 


9.3. Find the coordinates of the foot of the perpendicular dropped from the point 
T (0, 2, —4) onto the plane x + y—z7+3=0. 
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x—2 


9.4. Determine whether the plane 8x — 3z + 11 = 0 and the line as 7 a 


9.5. 


9.6. 


9.7. 


*9.8. 


Vo 


1 
are parallel. 


-—1 
Determine whether the plane x + 2y — z — 2 = 0 and the line - da 


1 
— a = are parallel? 
Write the necessary and sufficient condition of perpendicularity of the plane 
. xX — XO y— Yo Z— £0 
Ax + By+Cz+ D = 0 and the line pata ak alae re 


m 
—xX0  Y-yo  Z%—2Z0 


Prove that the condition of belonging of the line a 


l m 
to the plane Ax + By + Cz + D = O has the form 
A B C D=0, 
xo + Byg + CZ + (9.32) 
Ak+ Bl+ Cm =0. 


Show that the distance from the point 7o(xo, yo, Zo) to the line i = = 
Bas pee aaa 


i can be calculated by the formula 


2 2 2 
we Fr+ Fy + FP; 
+P? +m?’ 
where the following designations are introduced 


I m m k k I 
Fl= ,Q= = 5 
y1 — Yo Z1 — 20 Z1 — Z0 X1 — X0 X1 — X0 Y1 — Yo 


x+1 


9.9. Find the acute angle between the lines specified by the equations a 


y-9 74+2 x ytl 7+4 
—_ = and — = ——— = 
—5 2 3 —6 3 


y-9 z+2 


1 
9.10. Find the obtuse angle between the lines abd Sg = =a and 


=i =5 


eS ee 


3 863 5" 
9.11. Find the equation of the plane that passes through the line 


FHM Cate lO 
= ae 


(9.33) 
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parallel to the line 


+1 yr? 2-6 
2 4° 8° 


: : x—xX _ \ os 
*9.12. Compute the distance between the skew lines ; ae ee ee) 
1 


X—xX2 Yr-yo 2-2 
and = FES CU. 
ko Ip m2 
Answers and Solutions 
9.1 BOs 2 re re a 
-=,-5,0),(-=,0,-=,), (0. -=, 29). 
( 14 14 ( 8 4 ( 2 
9.2 Answer: 


9x — 1l0y —4z-14=0. 


9.3 Solution. 
Write the equation of the line that passes through the point T perpendicular to 
the plane: 


yer ea 


7 = (9.35) 
1 1 =1° , 
Represent the equation of this line in parametric form: 
x=u, y=ut2, 7=-u-4, (9.36) 
where u € R. 
Substitute these coordinates into the equation of the plane: 
u+(u+2)—(-u—4)+3=0, (9.37) 


hence, we obtain u = —3. 
Knowing u, find the coordinates of the intersection point of the line and the plane: 


= —3, yo —d, eS (9.38) 


So, the intersection point of the perpendicular dropped from the point T onto the 
plane has the coordinates (—3, —1, —1). 


Answers and Solutions 331 


9.4 Solution. 
x-x - - 
The condition of parallelism of the line ee eae eae and the 


plane Ax+ By+Cz+D = Oreduces to fulfilment of the malty Ak+ Bl +Cm=0 
that reflects orthogonality of the directing vector t = (k,/,m) and the normal to 
the plane n = (A, B,C). 

According to the problem statement, t = (2, 3, 8), m = (8, 0, —3). 

Since the equality to zero of the scalar product (t - 2) = 0 is valid, then the line 
and the plane are parallel. 


5) -1 


—1 1 
9.5 Answer: no, the plane x + 2y — z — 2 = 0 and the line i = a = 


are not parallel. 


9.6 Solution. 
The normal vector of the plane Ax + By + Cz + D = 0 has the coordinates 
*—X0 YY _ 


(A, B,C). At the same time, the directing vector of the line i = 


me is equal to (k,/,m). 


m 
In order for the line to be perpendicular to the plane, it is necessary and sufficient 
that the normal vector of the plane be collinear with the directing vector of the line: 


9.7 Proof. 

Indeed, the first equation (9.32) means that the point (xo, yo, Zo), through which 
the line passes, belongs to the plane. The second equation reflects the fact of 
parallelism of the line and the plane (see Problem 9.4). 


9.8 Proof. 

From the equation of a line, we find the coordinates of its directing vector s = 
(1, m,n). Denote the point (x, y1, z1) that lies on this line by 7) (x1, y1, Z1). 

It is known from the properties of the vector product that the modulus of the 
vector product of vectors is equal to the area of the parallelogram constructed on 
these vectors: 


—> 
S =|To7\ x s|. 


On the other hand, the area of the parallelogram is equal to the product of its side 
by the height drawn to this side: S = |s|d. 

In our case, the height of the parallelogram is equal to the distance d from the 
point to the plane, and its side is equal to the modulus of the directing vector |s|. 
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Having equated the expressions for the area, it is easy to obtain a formula of the 
distance from the point to the line: 


—" 
_ [ToT x s| 


|s| 


d 


Hence, we find 


i j k 


— 
ToT X 8 = |x} — x0 y1 — Yo Z1 — Z0 
k I m 


= (m(y1 — yo) —1(z1 — 20))t 
+ (k(z1 — Zo) — m(x1 — x0) J 
+ (x1 — x0) — kv — yo))k. 


Thus, 
— 
[ToT x s| 
yi — Yo Z1 — 20 Z1 — Z0 X1 — XO X1 — X0 Y1 — YO 
where 
Is] = Vk7 +12 4+m?. 
Therefore, 
I m m k k I 
+ + 
y1 — Yo Z1 — 20 Z1 — Zo X1 — X0 X1 — X0 Y1 — YO 
d= 
Vk? +1? +m? 


9.9 Solution. 

The lines are specified in canonical form. 

Find the angle a between their directing vectors a = (2,—5,2) and b = 
(3, —6, 3): 


(a -b) 
a= : 
|a| - |D| 
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263A 3) 16) 429 14 
2+ (—5)2 + 22- /32 4 (—6)2 +32 322 


cosa = 


14 
The sought angle is ~@ = arccos . 
ehang (<7) 
9.10 Solution. 
Find the angle a between the directing vectors a = (—1,—5,7) and b = 
(—3, 3, 5) of the specified lines: 


a-b 
cosa = ——— 


|a||b| 
We substitute here the numeric values from the problem statement and obtain 


(-1)- (—3) + (-5)-3+7-5 23 


cos Qa S SO 


VEIP + O52 + PY C32 + 245? S129 


23 
Therefore, the sought angle is 7 — a = m — arccos : 
= (= ra) 
9.11 Solution. 
The sought plane is parallel to the directing vectors aj = (—7,3, —1) and a2 = 
(2, 4, 8) and passes through the point 7o(10, 0, —10) that lies on the first line. Thus, 
—s 


the vectors a1, a2 and ToT are coplanar. 


— 
Expanding the determinant that expresses the scalar triple product (ToT , a1, a2), 
for example, in the first row, we obtain 


x—10yz+10 
—-7 3 —1 | = 28(x — 10) +54y — 34(z + 10) = 0, 
2 4 8 


and we finally obtain 
14x + 27y — 17z— 310 = 0. 


9.12 Solution. 

Introduce the designations T; (x1, y1, Z1), T2(x2, y2, Z2), 41 = (k1,11,m)1), a2 = 
(k2, 12, m2). _. 

Consider the parallelogram constructed on the vectors 7; 72, a; and a2, and 
compute its volume V. 

On the one hand, the volume V is equal to the product of the module of the 
vector product |a; x a2| and the height of the parallelogram. On the other hand, 


—? 
V =abs(T|7>, a), a2). 
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Since the sought distance d between the skew lines is equal to the height, then 


—> 
__ abs(7 T2, a1, 42) 


la, X ap| 


X2 —X1 Y2— V1 22 — Z1 
abs} ky ly m4 


kp lo m2 


2 mi 


kyl I, my my ky 
+ + 


ko lh ln m2 my kz 


Chapter 10 ®) 
Bilinear and Quadratic Forms cts 


10.1 Bilinear Forms 


Assume that in a n-dimensional vector space Ly, a basis B = (1, €2,...,€n) iS 
specified. Consider two vectors belonging to the space Ly: 


n n 
Y=) aes. y=) ee (10.1) 
i=1 i=1 


where xj, yj € Rforalli = 1,2,...,n. 
The linear combination of all possible products of the projection of the vectors x 
and y on the basic normalized vectors 


n 


Y> aijxiy;, (10.2) 


i,j=l 


where aj; are arbitrary real numbers, is referred to as the bilinear form of A(x, y) 
defined on the basis B. The matrix A = (a;;), where 1 < i, j <n, is called the 
matrix of bilinear form. As is easy to see, an arbitrary bilinear form can be written 
with the help of the matrix multiplication operation as 


A(x, y) =x Ay. (10.3) 


The following properties of linearity of the form are fulfilled for each of its 
arguments: 
Vx,y,z€L,andVaeR 


1. Ax +y,z) = A(x, z) + AQ, 2), 
2. A(ax,z) =aA(x, Z), 
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3. A(x, y+z) = Ax, y) + A(x, 2), 
4. A(x, az) =aA(x, Z). 


A bilinear form is referred to as symmetric, if for any x, y € L, the condition 
A(x, y) = A(y, x) is fulfilled. 

A bilinear form is referred to as positively definite, if Vx <¢ Ly, x ~ 0, the 
inequality A(x, x) > 0 is fulfilled. 


Example 10.1 Consider the vectors x = (x1, x2,.x3) and y = ()1, y2, y3) of some 
three-dimensional vector space, and the bilinear form A(x, y) = x,y, + 3x2y2 + 
8x33, defined on the basis of this space. The matrix A of bilinear form will in this 
case have the following diagonal form: 


100 
A=1/030]. (10.4) 
008 
Represent A in matrix notation: 
100] }] 
A(x, y) =x Ay = [x1,x2,431/030] | yo 
008] | y3 
y1 
= [x1, 3x2, 8x3] | yo | = X11 + 3x2y2 + 8x3y3. (10.5) 
¥3 


It is easy to show that it has the properties of symmetry and positive definiteness: 


A(x, y) = X11 + 3x2y2 + 8x33 = yix1 + 3y2x2 + 8y3x3 
= A(y, x) = the form is symmetric; (10.6) 


A(x, x) = x? + 354 + 8x3 > 0 for all non-zero vectors x 


=> A(x, y)is a positively definite form. (10.7) 


Example 10.2 M(x, y) = x1y1 — X22 — X33 — Xay4 is a bilinear form on R*. As 
is easy to see, it is symmetric, but not positively definite. Note that this form defines 
the space-time metric in the special theory of relativity [45]. 


10.2. Quadratic Forms 337 
10.2 Quadratic Forms 


Quadratic form will be referred to as the expression 

w(x) = A, x), (10.8) 
where A(x, x) is some bilinear form. The name “quadratic form” is associated with 
the fulfilment for this expression of the property of the second degree homogeneity 
in the argument of the form: Va € R_ the following equation is valid: 


w(ax) = a a(x). (10.9) 


Example 10.3 In the basis of the bilinear form M(x, y), defined in Example 10.2 
on page 336, we can construct the quadratic form 


w(x) = M(x, x) = xf — x3 — x5 — 34 (10.10) 


that depends on four variables: x1, x2, x3 and x4. 


If in the n-dimensional vector space L, the basis is specified and the vector 
x = [x1,%2,...,Xn]” is selected, then 


n 
o@sx Ax = Sait; (10.11) 
i,j=l 


which allows interpreting the quadratic form as a function specified on the set of all 
possible vectors x. 

The matrix A = (a;;) is referred to as the matrix of quadratic form w(x). This 
matrix can be deemed symmetric, since the expression of the form aj; x;Xx j +4 jiX j Xi, 
due to commutativity of multiplication of real numbers, can always be represented 
as 


AjjXjXj + A jix jXi = Ajjxix; + GjiXjXi, (10.12) 


where the designation aj; = Gj; = (aij + a;;)/2 is introduced. 
So, an arbitrary quadratic form in some basis can be specified in matrix form: 


x’ Ax, (10.13) 
where x = (x1, %2,...,%Xn) is the column composed of variables, and the matrix 


A of quadratic form always allows a notation in symmetric form: aj; = aj;; for all 
i,j=l,...,n. 
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The symmetric bilinear form A(x, y) is referred to as polar to the quadratic form 
w(x) = A(x, x). For the quadratic form, the formula is valid: 


1 
A(x, y) = xem + y)—a(x)— w(y)]. (10.14) 


Example 10.4 Find the bilinear form polar to w(x) = x1x2 + x2x3 + X1%3. 


Solution According to the definition of polar form (10.14), we obtain 


1 
A(x, y) = sl + yi)(x2 + y2) + (x2 + y2) (03 + y3) + (x1 + 1) (03 + 3) 
— (x1x2 + x2x3 +.x1x3) — (Via + yoy3 + yiys)]. (10.15) 


After algebraic transformations, we find the expression for the sought bilinear form: 


1 
A(x, y) = 3 oiy2 + x1y3 + x21 + x2y3 + x31 + X3Y2). (10.16) 


When passing to a new basis, i.e. during nondegenerate change of the variables 
X1,X2,..., X, With the change matrix C, the matrix A’ of quadratic form in the new 
basis will take the form: 


A’=C'TAC. (10.17) 


It is known that changing the basis does not result in the change of the rank of 
the matrix in quadratic form. 

Rank of the matrix in quadratic form is referred to as the rank of quadratic 
form. If this matrix has a rank equal to the dimension of the vector space, i.e. the 
number of variables n, then the quadratic form is called nondegenerate, and if the 
rank is less than n, then it is called degenerate. 


Example 10.5 Find the matrix of quadratic form of the three variables: 
co(x) = 3x? + 2xyxq — 5x23 — x3 + x2. (10.18) 


Solution The diagonal elements a;; of the matrix of quadratic form w(x) are 
defined as the coefficients of the quadratic summands x and the non-diagonal ones 
aij, where i # j, are twice smaller than the respective coefficients of the summands 
of the form x;x;: 


3 1 5/2 
Ae # a Ge |, (10.19) 
—§/20 1 
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Using any of the known methods for computing the rank of the matrix, for 


example, the method of bringing to echelon form, we find rk A = 3. Since the 
tank of the matrix A is equal to the number of variables, then this quadratic form is 
nondegenerate. 


In matrix notation, the quadratic form can be represented in the form: 


3 1 -5/2] [x1 
o(x) =[x1,%2,%3]} 1 -1 0 || x2]. (10.20) 
S50" 1 x3 


10.3 Bringing the Quadratic Form to the Canonical Form 


If, in some basis, the matrix of quadratic form takes diagonal form 


= (10.21) 


000... O dr, 


then, as is easy to see, the quadratic form is formed by a linear combination of 
squares of the variables x1, x2, ..., Xn: 
= 2 2 2 
w(x) = yxy + Aoxs +++ + Anx;. (10.22) 
The crossing summands of the form x;x; fori # j are in this case not included into 
the expression for w(x). 


Among the coefficients A;, where i = 1, 2,...,, there can be positive numbers, 
negative numbers and numbers equal to zero. 


Theorem 10.1 For each quadratic form, there exists a basis, in which it has the 
canonical form: 


w(x) = So Ajx?. (10.23) 
i=1 


Attention should be paid to the fact that the canonical basis is defined non- 
uniquely. 
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10.3.1 Lagrange’s Method of Separating Perfect Squares 


In applications, we often come across a problem of bringing a quadratic form to a 
diagonal form. Several methods have been suggested for solving this problem. 
Lagrange’s method of bringing the quadratic form to the canonical form 


consists in the following. 
n 


Assume that the quadratic form w(x) = > ajjxixj is given, defined in the 
i,j=l 
basis of the space Ly. 
Depending on the presence in the sum of the summands of the form aiix?, 


consider two cases. 
1. At least one of the coefficients at, of the quadratic summands is not equal to zero. 


The main idea of the method consists in separating the perfect square, uniting all 
the summands that contain the variable x;. The obtained perfect square is used as the 
basis for change of the variables, excluding the terms linear in x;. If in the form still 
remain crossing variables of the form x;x;, then we return to the beginning of the 
procedure. Otherwise, the form contains only quadratic summands, and the solution 
is complete. 

As we can see, without loss of generality, we may assume that aj; 4 0. 

Consider the first step of Lagrange’s method in more detail. 

Denote by S the sum of all summands containing x1: 


S = ayyx? + Qayoxix2 +++ + 2ainxiXn, (10.24) 


and complete the square of S. We obtain 
2 
S = ayi(x1 + xn +--+ an) aR (10.25) 
a 


where the expression R does not contain x in its notation. 
Then, change the variables 


ail (10.26) 


RSX POV AS 2 een: 


Then, the quadratic form in the new basis will take the form: 


n 
w(x) = ar(x4)? + DS ajpxtx;. (10.27) 
i,j=2 
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From here on, the same method can be applied to the variables x2, x3 and so on, 
to finally exclude all summands of the form x;x; fori # j. 


2. All coefficients are aj; = 0. 


In this case, select aj; # 0 for some i 4 j and change the variables: 


xj= xi — x! (10.28) 


As a result, each product x;x; will be presented in the form of a linear 
combination of the quadratic summands x;x ; = (x; y~— (x4), and we will arrive at 
the first case. 

After step (1) and, when necessary, step (2), bringing of the quadratic form to the 


diagonal form will be completed. 


Example 10.6 Using Lagrange’s method, bring the following form to the canonical 
one 


w(x) = 2x1x2 — 6x1x3 — x5 + Sx}. (10.29) 


Solution Collect the summands that contain x9: 


—x3 + 2x1x2 —6x1x3 + xe —— Ge — 2x1x2 + x7) + xe — 6x1x3 + he 
—— — a ae 
S perfect square —R 
= —(xy — x2)? +x] — 6x1x3 + 5x3. (10.30) 


Change the variables: 


YI= 1, 
y2 = X1 — x2, (10.31) 
y3 = x3. 
As aresult, we obtain the form 
a 2 2 2 
o(y) = —ya + yy — Oyiy3 + Sy3. (10.32) 


Now collect the summands that contain y;, and complete the square of them 


oly) = yz + OF — 6y1y3 + 9yZ) — 9yZ + Sy} 
= ~y3 + (91 — 3y3)* — 4y3. (10.33) 
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Perform the second change of the variables: 


z1 = y1 — 3y3, 
22 a y2; (10.34) 
43 = 3. 


We finally obtain w(z) = zt — 25 — 423. 


Example 10.7 Using Lagrange’s method, bring the following form to the canonical 
one 


(xX) = xy xX2 + xy x3 + X2X3. (10.35) 


Solution Since there are no summands of the form x? in this expression, we change 
the variables 


xy =x, +x, 


LH as (10.36) 
ee 
as aresult of which we obtain 
w(x’) = [ei + 2x'xh + (<4) | — (xh)? — (x4)? (10.37) 
= (x) +24)? — (0h)? — (4). (10.38) 
The new change 
yi = xy +4, 
y2 = Xs (10.39) 
y3 = X3 
results in the form 
o(y) = yt — ¥3 — 5, (10.40) 


1 1 
where y| = rhaa + x2) + x3, 2 = 51 — X2), ¥3 = X3. 
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10.3.2 Jacobi Method 


Let the following determinants composed of the elements of the matrix A = (aj;) 
of the quadratic form w(x) be other than zero: 


a a 
Ree. Dee tess (10.41) 


a21 422 


Q{{1 aj2... Qn 


a2| a22 ... A2n 


An=| ; i leg (10.42) 
Gn\ An2 .-- Ann 
Then, there exists the basis B = (e1, €2,..., €n), in which w(x) is presented in the 
form: 
Ai, Ar» An 9 
w(Z) = —zy t+ 2+ + has 10.43 
(Z) ere. er ( ) 
where z;,i = 1,2,...,n, denote the coordinates of the vector x in the new basis 


B, and for uniformity, we assume that Ag = 1. This is the essence of the Jacobi 
method of bringing the quadratic form to the canonical form. 

In comparison with Lagrange’s method, the Jacobi method has an advantage that 
the transition to the basis B is direct, without any intermediate steps. 


The transition from the basis (e€,,e€2,...,@,) to the canonical basis 
(C1, €2,..., €n) is performed by the formulae 
i 
cj = ye nijej, i=1,2,...,n, (10.44) 
j=l 
si a Nas 
ny = (DH (10.45) 
Aji-1 


where A;_1,; is the minor of the matrix formed by the elements from (a;;) that 


are situated at the intersection of the rows numbered k = 1,2,...,i — 1 with the 
columns numbered k = 1,2,...,7-—1,j41,...,i. 
Note that the introduced determinants A;, Ao, ..., A, are called the corner 


minors of the matrix of quadratic form. 
Rank of quadratic form is the number of non-zero coefficients A; 4 0. 
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Let us introduce the following designations: 


* n4—number of positive coefficients of A; > 0, 
¢ n_——number of negative coefficients of A; < 0, 
* no—number of coefficients equal to zero of A; = 0. 


The ordered set of integral non-negative numbers (n+,n_,709) is called the 
signature of quadratic form. 

The quadratic form w(x) is referred to as positively definite, if for any non-zero 
x the inequality 


w(x) > 0 (10.46) 


is fulfilled, and negatively definite, if w(x) < 0 is fulfilled. 
The quadratic form w(x) is referred to as alternating, if there exist such x; and 
X2, that the following inequalities occur 


o(xX1) >0, w(x2) <0. (10.47) 


Sylvester’s! Criterion For the positive definiteness of the quadratic form w(x), 
it is necessary and sufficient that all the primary minors of its matrix should be 
positive: Ay > 0, Ar > 0,..., An > 0. 

For the negative definiteness of the quadratic form w(x), it is necessary and 
sufficient that the signs of the corner minors of its matrix should alternate, and 
A, < 0. 


The Law of Inertia The number of summands with positive (negative) coefficients 
of a quadratic form brought to the canonical form does not depend on the method 
used to obtain such a representation. 


Review Questions 


1. Define bilinear form. 

2. Enumerate the property of linearity of bilinear forms. 

3. What bilinear forms are called symmetric and positively defined? 

4. Define quadratic form. 

5. Explain why a matrix of an arbitrary quadratic form can always be deemed to 
be symmetric. 

6. What bilinear form is called polar to the quadratic form? 

7. How is the rank of a quadratic form determined? 

8. What is the difference between degenerate and nondegenerate quadratic forms? 


'James Joseph Sylvester (1814-1897), English mathematician. 
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9. Explain the methods of bringing quadratic forms to diagonal form: Lagrange’s 
method and the Jacobi method. 
10. How are the corner minors of a matrix of quadratic form determined? 
11. Formulate Sylvester’s criterion. 
12. What is the law of inertia of quadratic forms? 


Problems 


10.1. Which of the following functions F(x, y) of the vectors x = (x1, x2) € R? 
and y = (y1, y2) € R? are bilinear forms? 


(1) F(x, y) = x1y1 + x2y2; 

(2) F(x, y) = V2; 

(3) F(x, y) = (x1 — y1)? — G2 — yp)? 
(4) F(x, y) = 4x2y2. 


10.2. Find the bilinear form that is polar to the form: 


(1) xf — 2xpx2 + 3x1x3 + 7x3; 
(2) —2x? + 3x1x3 + .x2%3; 
(3) 3x1x3 + oe 
(4) x2 + Axyxo + 4x13 — 4x} — 2x2x3 - ere 
10.3. Bring to the canonical form the quadratic form of three variables: 
(1) xf + 2xyx2 + 4x1%3 — 4x4; 
(2) x1xX2 + .X1x3 — 6x2x3; 
(3) —2x? — 7x1x3; 
(4) x2 — 4xyx2 + 12x1x3 — Er + 4x9x3 + Bie 
10.4. Bring to the canonical form the quadratic form of four variables: 
(1) a + 2x1, x2 +2x1x3 4+ 2x, x4 + Sx5 + 6x2x3 + 8x2x4 + x4 +2x3x4+ Ere 
(2) xyx2 + x1 x4 + x2x3 + x3X45 
(3) —5x7 + 2x1x4 + 3x5 + x2x3 — 2x2x4 — x? + 2x3x4 + ve 
(4) x2 + xyx0 + X1x3 + x4x4 + i + x2x3 + xox4 + i + x3x4 + xy 


10.5. Which of the following quadratic forms are positively definite, negatively 
definite and alternating? 


(1) xf + x1x2 + x3; 

(2) x} — 9x1x2 + x3; 

(3) xyx2 + 2x1x4 + 3x2x3 + 4x3x43 
(4) xyx2 + x3x4. 
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*10.6. At what values of the real parameter a is the quadratic form: 


w(x) = ax} + 2x1x2 + (10 — a)x4 (10.48) 


positively definite and negatively definite? 


Z XiXj sh . 
*10.7. Show that the form }) > -———— is positively definite. 
petted 


Answers and Solutions 


10.1 Answer: 
The functions from items (1) and (4) are bilinear forms. 


10.2 Solution. 
Use the formula (10.14) for the polar form: 


(1) 
1 
A(x, y) = sloe + y) — w(x) — o(y)] 
1 
= 5[@1+ yi)? — 2(x1 + yi)(*2 + y2) 
+ 3(x1 + v1) (3 + y3) + 703 + y3)? 
— (xf — 2xpx2 + 3x1x3 + 7x3) — (vy? — 2y1y2 + 3y1y3 + 7y9)] 
3 3 
= xy) —xX1y2 + gxiys 2291 + Bray! + 7x33; 
(2) 


1 
A(x, y) = 5[- 201 + yi)” + 31 + y1)x3 + y3) + (x2 + y2) (03 + 3) 
— (—2xf + 3x103 + x2x3) — (—2y7 + 3y1y3 + y2y3)] 
Sr Kee tee ad 
= —2x =x =x =x =XxX3Y23 
yi + 5 x1y3 + 5xays + axayi + 5%392 
(3) 
1 2 
A(x, y) = 513@1 + yi)(~3 + y3) + (x2 + y2) 
— (3x1x3 + x2) — By1y3 + y3)] 


3 3 
= FX1Y3 + X292 + TX3ZYI1;5 
7 °1y3 1 X22 4 F X31 
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(4) 


Ax, y) = sl + yi) + y1) +4001 + y1) (2 + 2) 
+ 4(x1 + yi) (x3 + y3) 
— A(xn + y2)? — 2x2 + y2)(x3 + y3) — (x3 + y3)(x3 + ya) 
_ (x7 + 4x,x2 + 4x1x3 - Ax} — 2x2x3 — x3) 
— (of + 4y192 + 413 — 493 — 2y2y3 — y5)] 
= x1y1 + 2x1 y2 + 2x, y3 + 2x2y1 — x23 — 4x2y2 
+ 2x3y1 — x3y2 — X33. 


10.3 Solution. 


(1) Transform the expression: 
x + 2x,x2 + 4x1x3 - Ax} 
= x? + 2x4 (x2 + 2x3) + (x2 + 2x3)? — (x2 + 2x3)? — 4x2 
= (x1 + x2 + 2x3)? — (x2 + 2x3)* — (2x3)°. 
Change the variables: 
yr = x1 + x2 + 2x3, 


y2 = x2 + 2x3, 
y3 = 2x3. 


We obtain the quadratic form in the canonical form: 
Yt — y3 — Y5. 
(2) Change the variables: 


xy =x, +5, 
’ / 
x2= X, — Xo, 


x3 = X}. 
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Transform the expression: 
X1X2 + x1x3 — 6x2x3 
= (x} + x5) (4 — 45) + ey +04) x5 — Ox] — 25) x4 
= (xj)? — (45)? + xh xg + xhx5 — Oxix§ + 6x4x5 


= (x})* — (45)? — Sxjxy + 7xhx5 


1\2 ESS 5 / : 5 : Le} 1y\2 
= (x1) —5x1x434+(=43}) — 5% + 7X4X3 — (X5) 


ws 


2 


/ 5 / = best 12 5 / 
= (x), - 533 + 7x5x3 — (X9)° — 573 
5 


Change the variables: 


/ 5 / 1 
eae aR ae phae + x2 — 5x3), 
7 1 
y2 = x4 - 7333 = had so = TNA) 
Y3 = V6x5. 


We obtain the quadratic form in the canonical form: 
2 2 2 
Yi — y2 + Y3- 
(3) Transform the expression: 


2 
— 2x7 — 7x1x3 


=— (v2x) — 7x 1x3 - (Es) at (4. 


2 2 
= (vax + 7) + (4:) ; 
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Change the variables: 


1. 
y= V2x1 + qos 
7/2 


J2= ae 


We obtain the quadratic form in the canonical form: 
~yi + ¥. 
(4) Transform the expression: 
67 — 4x, x2 + 12x1x3- ats + 4x9x3 + Bie 
= x7 — 4x1 (x2 — 3x3) + (2x2 — 6x3)" 


— (2x2 — 6x3)? — x3 + 4x9x3 + 3x3 


= (x, — 2x2 + 6x3)" _ Ax + 24x2x3 — 36x3 _ as + 4x9x3 + 3x3 


= (x, — 2x7 + 6x3)” — 5x5 + 28x2"3 — 33x3 
2 
= (x1 — 2x9 + 613)? — (V5x2) + 28x08 
2 2 
14/5 14/5 ‘ 
= 5 x3) + 5 x3) — 33x3 


2 
14/5 31, 
ae 


= (x, — 2x2 + 6x3)" - (vse a 5 


Change the variables: 


yy = xy — 2x2 + 673, 


14 
y2= V5( _ =") , 


/31 
= ,/—x3. 
3 5 3 


We obtain the quadratic form in the canonical form: 


yi yz + Y3. 


aTH%*3: 


349 
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10.4 Solution. 


(1) Transform the expression: 
xt + 2x,x2 + 2x1x3 4+ 2xpx4 + 3x5 + 6x2x3 + 8x2x4 + a + 2x3x4 + a 


= x} + 2x1 (x2 +43 +04) + (x2 + x3 + 04)” — (02 +03 +24)? 


+ Ba + 6x2x3 + 8x2x4 + re + 2x3x4 + xi 


= (xj) $40 +-%3 44)? ie 2x2x3 — 2x2x4 ie 2x3X4 a; 
+ cr + 6x2x3 + 8x2x4 + ee + 2x3x4 + x 


= (4, tao +434 x4)* + 2x2 + 4x9x3 + 6x2%x4 
2 2 
= (xy +22 +43 +44)" + (v2x2) + 2x2(2x3 + 3x4) 


1 2 1 2 
+ 3 2x3 + 3x4)° — 3 2x3 + 3x4) 


2 
3/2 
= (xj t+x2 +434 x4)" + (vin + 2x3 + 1 


2 
3/2 
= (vx + a) : 
Change the variables: 


MH xX, + x2 +x34+%4, 


3/2 
y= J 2x9 + J2x3 + =a X4, 


3/2 
y= J2x3 + <a 


We obtain the quadratic form in the canonical form: 
yt + y2 — 3. 
(2) Change the variables: 


x1 =x, +%5, 

/ / 
x2= X, — Xo, 
Sy Hy, 


or eee, 
X4 = Xz — XY. 
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Transform the expression: 
X1xX2 + xXpxX4 + X2X3 + X3X4 
= (rp + 15)Q] = 29) + Or + 44) (83 — 24) 
+ (X= 29) +14) + (3 + 44) 05 — 14) 


= (xj)? — (0h)? + Qxjx5 — 2xhx4 + (04)? — (2x4)? 


= (xj)? + Qxhxg + (204)? — (05)? — 2xhxh — (x4)? 


= (x, + x — (xh + see 
Change the variables: 


1 
yHxytx4= g Fi + x2 + x3 + 4), 


y2 =x, +454 = xen — x2 +x3— 4x4). 
We obtain the quadratic form in the canonical form: 
yt — ¥3- 
(3) Transform the expression: 


_ 5x7 + 2x1x4 + 3a, + x9x3 — 2x2x4 — ae + 2x3x4 + x4 
= ber + 2xq(x1 — x2 +43) + (x1 — x2 +23)” 
— (xj —x2+ x3)? _ sae + 3x5 + x2x3- ae 


= (x, —x2 +434 x4)* _ 6x; + 2x,x2 + ie — 2x1x3 + 3x2x3 - 2x3 


2 
2 V6 
= (xy — x2 +43 +04)? — (v6x1) + 2x1 (x2 — x3) — (So - >) 
2 
+ (Be - ~») + xs + 3x2x3 — One 


2 
= (x1 — x2 +43 4x4)? — a = Bey = ~) 


pAoeesst ity 
—_ =XxX2xX3 — —Xx 
Boe are. Bre 
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2 
= (xp — x2 +3 +44)? — a = Bexs = >) 


2 2 2 
13 8 8V78 8/78 ll 5 
ae x2} +5%2x3 4+ x3) — X3] — 7X3 


78 78 


2 
= (x, — x2 +3 +04)? — (vex = Wee = ~) 


2 2 
«if 18 _ 8vB 69 
—X Xx = —X 
ao ee ¥ 26° 


Change the variables: 


YI SX — X02 +23 4X4, 


V6 
y2 = V6x1 — — — x3), 
13 8V78 


Be et 78 


_ [69 
Yar Tae 


We obtain the quadratic form in the canonical form: 


X3, 


Yi e+ 3 — YG. 
(4) Transform the expression: 
ee + X1xX2 +2X1xX34+X1X4 + xs + X2X3 + X2X4 + xe + x3X4 + a4 
2 


1 1 
= xf + x1(m2 +23 +54) + 72 + x3 + x4)? — 5 (2 + x3 + x4) 


+ xe + X2X3 + X2X4 + x3 + x3X4 + x; 


2 
= [Xx Xx x x Xx. XIX Xx 
1+ 5 3 4 gta + 5xa%3 + 5 x2%4 


Bie. at 35 
Te BG 5 MONAT ha 


‘ 2 
= (x + 5 + x3 + x3)) + (2) 
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2 
+ 52203 + x4) + (Bos + “) 


1 2 {v3 WB ° 
= (1+ 500 ta +49) + amt —G 3 + 4) 


25. i oy 
ee a er 


1 3 3 : 
= (« + 5 2 +3 +x) + (x + Be +1] 


2 2 2 
" [2 ,(% V6 ne 
— Xx —X3X. — xX. = —X —X 
3° a * ie ia ca 


7 2 
= (« + 5 + x3 +0) + (Fx + Bes +x] 


2 2 
n (2 é V6 " 10 
=x3 + —x —x4] . 
anual 4" 
Change the variables: 


1 
yHxt+ 5 2 + x3 + x4), 


V3 V3 
ha, + wane + x4), 


J2 


We obtain the quadratic form in the canonical form: 
2 2 2 2 
Yi + y2 + Y3 + Y4- 


10.5 Solution. 


(1) Write the matrix of quadratic form: 
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Use Sylvester’s criterion. For this, compute the corner minors: 
1 
2 ty? 
if=1>0, 2d | |) Sh 


Therefore, the form is positively definite. 


(2) Write the matrix of quadratic form: 


Ne 


Use Sylvester’s criterion. For this, compute the corner minors: 


9 
t--=2 


2 9\2 

ji] =1>0, =j={=2). 20. 

9 2. 
- 1 
2 
Therefore, the form is alternating. 
Assume that x3 = x4 = 0. In this case, only one non-zero summand x1x2 
remains in the quadratic form. It is obvious that this summand can take both 
positive and negative values. Therefore, the form is alternating. 
Assume that x3 = 0. In this case, only one non-zero summand x;x2 remains 


in the quadratic form. It is obvious that this summand can be both positive and 
negative. Therefore, the form is alternating. 


10.6 Solution. 
Write the matrix of quadratic form: 


(3 


wm 


(4 


~m 


a 1 


110-—a 


Use Sylvester’s criterion. For this, compute the corner minors: 


1 
a\ =a; =a(10-—a)-1. 
110-a 


| | ; 
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In order for the form to be positively definite, it is necessary and sufficient that both 
determinants should be positive: 


a>0O, a>0O, 
=> 
ad0—a)—-1>0; a* —10a+1 <0; 


a>0O, 


ae (5—2v6,5 +26). 


In order for the form to be negatively definite, it is necessary and sufficient that 
the corner minor of the first order should be negative, and that of the second order 
should be positive: 


a <0, 


ae (5—2V6,5+2v6). 


This system is inconsistent; therefore, the form cannot be negatively definite at any 
values of a. 
Consider the cases when the corner minors are equal to zero: 
a=0, w(x) = 2x1x2 + 10x3; 


w(x) can take a negative value, for example, at x; = —10, x2 = 1; 


a=5-2v6, 


w(x) = (5 = 2V6) x? 4 2xyxy + (5 a 2V6) 3 


2 2; 
= ( 5— 2x1) +2x1x. + ( 5+ 26x) 


= (v's —2V6x1 + V54 2b) 


It is clear that Vx 4 0 the inequality w(x) > 0 is fulfilled. 
Ata=5+ 2/6, the reasoning is similar. 
We obtain the final answer: the form is positively definite if and only ifa € 


[s 0/6 5 26], 
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10.7 Solution. 
Transform the expression, having introduced integration by the auxiliary vari- 
able t: 


i=1 


1 1 
Bratt B feast f Soe 


n a2 
As is easy to see, the expression ( Ye xt! 7) is always greater than zero at the 
i=1 
non-zero values of x1, x2,...,X, andt > 0. 
The initial quadratic form is positively definite, since it is the integral of the 


positively definite form. 


Chapter 11 ® 
Curves of the Second Order Ghost for 


The general equation of curve of the second order has the form 
Ax” + 2Bxy + Cy? +2Dx + 2Ey + F =0, (11.1) 


where the real coefficients A, B, C are not equal to zero simultaneously. 

The expression Ax” + 2Bxy + Cy” is referred to as the quadratic term of the 
equation of this curve, 2Dx +2£y is referred to as the linear term, and F is referred 
to as the constant term. 

In the so-called canonical system of coordinates, the equation of curve of the 
second order takes the simplest canonical form [8]. 

Let us consider the classification of curves of the second order based on their 
canonical form. 


11.1 Ellipse 


Ellipse is a curve of the second order that in some Cartesian rectangular system of 
coordinates is defined by the equation 


abe eT, (11.2) 


where a > b > O. The numbers a and D are referred to as the major and minor 
semiaxes of the ellipse, respectively. 

The points (ta, 0) and (0, +b) are called the vertices of the ellipse, and (+c, 0), 
where c = Va? — b?, are its foci. We will denote the foci by F; and F>. Figure 11.1 
schematically shows an ellipse in the canonical system of coordinates. 

In case of equality of the constants a = b, the ellipse degenerates into a circle. 
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Fig. 11.1 Ellipse i 

x2 y?2 r¥ 
a + a = |. The directrices b 
x = +ta/e are shown by the 


dotted lines 


From the geometrical point of view, the ellipse is a set of points of a plane, 
for each of which the sum of distances to two specified points F, and F> (foci) is 
constant and equal to 2a > F\ F>. The focal distance, i.e. the distance between F 
and F>, is equal to 2c. 

The non-negative number ¢ = c/a < | defines the degree of “compression” of 
the ellipse on the abscissa axis and is referred to as eccentricity. The greater is the 
value ¢, the more pronounced is the “compression” (see Fig. 11.2). As is easy to see, 
the eccentricity of the circle is equal to ¢ = Va? — b*/a = 0. In this connection, 
we can say that the ellipse is obtained from the circle through its compression on the 
Ox-axis, when the ordinates of all its points decrease in the same proportion b/a. 


Theorem 11.1 The distance from an arbitrary point P(x, y), which belongs to the 
ellipse, to each focus, is equal to 


ry = PFj =a-—eéx, r2= PFop=at+ex. (11.3) 


The ellipse has two directrices — the straight lines of the form x = -ta/e. They 
are shown in Fig. 11.1 as dotted lines. Directrices are not defined for the circle. The 


Fig. 11.2 Ellipses with 
different eccentricities: « = 0 
(solid line), ¢ = 0.7 (dotted 
line), ¢ = 0.9 (dash-and-dot 
line) 
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directrix and the focus that lie on the same side of the axis Oy will be considered to 
be corresponding to each other. 


Theorem 11.2 For any point that belongs to the ellipse, the ratio of its distance to 
the focus to its distance to the corresponding directrix is equal to the eccentricity of 
the ellipse. 


Let us consider the mutual arrangement of the ellipse and the straight line. An 
arbitrary straight line either does not intersect the ellipse or intersects it at one or two 
points. If there is the only common point, such a straight line is a tangent. Exactly 
one tangent passes through any point on the ellipse. 

The equation of tangent to the ellipse at the point Po(xo, yo) has the form: 


xxX90  Yyo 

ae =P po 1. (11.4) 

The ellipse has the following optical property: the light beams originating from 

one focus, after mirror reflection from the ellipse, pass through another focus, i.e. 
focus on it. This explains the origin of the term “focus” borrowed from optics. 


Example 11.1 Let us find the intersection points of the coordinate axes and the 


2 2 
x 
tangent drawn to the ellipse 7 + es 1 at the point with the coordinates 


4 
(3/2, V3). 
Solution Denote the tangency point by Po and verify that it belongs to the ellipse: 


GY. Gar 
9 4 


= 1, orl = 1 —true. (11.5) 


Taking into account that the semiaxes of the ellipse are equal to a = 3 and b = 2, 
substitute the coordinates Po(xo, yo) into the equation of tangent (11.4): 


yan | Vy _ 


1. LI, 
9 4 ee) 


Therefore, the general equation of straight line has the form 2x + 3./3y — 12 = 0 
(see Fig. 11.3). The intersection points of this line with the coordinate axes are (6, 0) 


and (0, 4/3). 


The idea of the ellipse as the compressed circle results in an alternative method 
of specifying the ellipse in parametric form. 
Transform the coordinates in accordance with the formulae 


/ J b 
x =x, y =e (11.7) 
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2 2 
Fig. 11.3. To the Example 11.1 The ellipse > + = = | and the tangent to it at the point 
Po(3/2, V3) 
After such a transformation, the circle x? + y? = a’, in the new system of 
x’)? 0’? 


coordinates, turns into the ellipse —,—+ — 
the course of mathematical analysis [76], any circle can be specified in parametric 
form as 


= |. Asis known, for example, from 


eee (11.8) 


y =asint, 
where the real number ¢ takes the values that belong to the interval [0, 277). 


From (11.7) follows that the parametric representation of the ellipse with the 
semiaxes a and b will have the form: 


= t 
‘i GCOST, where 0 <t < 2m. (11.9) 


y = bsint, 


The parameter f is referred to as the anomaly of eccentricity. 


11.2 Hyperbola 


Hyperbola is a curve of the second order, which in some Cartesian rectangular 
system of coordinates is defined by the equation 


ea es (11.10) 


where a, b > 0 (see Fig. 11.4). The numbers a and b are called real and imaginary 
semiaxes of the hyperbola, respectively. 
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Unlike the ellipse, which is a connected curve, the hyperbola consists of two 
connected components: left and right branches (sheets of the hyperbola). 

Vertices of hyperbola are the points (ta, 0). Foci of hyperbola are the points 
F\(—c, 0) and Fy(c, 0), where c = Va? 4+ b?. 


b 
A hyperbola has asymptotes y = + —x that define the run of the curve for 
a 


infinitely great values of coordinates. 

A hyperbola contains only those points of Cartesian plane, the modulus of the 
difference of distances of each of which to the two given points F; and F2 (foci) is 
constant and equal to 2a < F\ Fp. The focal distance is equal to 2c. 

For the hyperbola, a concept of eccentricity « = c/a > 1 is also introduced. 


Theorem 11.3 The distances r}, rz from an arbitrary point P(x, y) of the hyper- 
bola to each focus are equal 


r) = PF, =abs(a—eéx), 12 = PF) =abs(a+ x). (11.11) 


The directrices of the hyperbola are specified in the canonical system of 
coordinates by the equations x = = and x = = (see Fig. 11.4, where the 


€ € 
directrices are shown by dotted lines). The directrix and the focus that lie one the 
same side of the axis Oy will be considered to be corresponding to each other. 


Theorem 11.4 For an arbitrary point that lies on the hyperbola, the ratio of its 
distance to the focus to the distance to the corresponding directrix is equal to the 
eccentricity of the hyperbola. 


The equation of tangent to hyperbola at the point Po(xo, yo) has the form: 


ee (11.12) 


a2 b2 


Optical property of hyperbola: the light from the source situated at one of 
the hyperbola’s foci is reflected by the second branch of the hyperbola so that the 
continuations of reflections of the beams intersect at the second focus. 


Fig. 11.4 Hyperbola 


x2 2 
2° po 1. The 
asymptotes y = +—x are 
a 
shown by thin solid lines, the 
‘ , a 
directrices x = +——by F: 2a a Ft 
€ 
dotted lines —¢ Cc x 


+ 
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Both for the ellipse and the hyperbola, in the canonical system of coordinates, 
the origin of coordinates O (0, 0) is the centre of symmetry of the curve, this is why 
the ellipse and the hyperbola belong to the class of central curves. 

In conclusion of the section devoted to the hyperbola, we will provide its 
representation in parametric form: 


= 2b ht 
per eae ne (143) 


y= bsinht, 


where ¢ € R, coshx = (e* + e*)/2 is the hyperbolic cosine, and sinhx = (e* — 
e *)/2 is the hyperbolic sine. In the first equation of this system, the sign “--” 
corresponds to the right branch of the hyperbola, and the sign “—” corresponds to 
the left branch. 


11.3. Parabola 


Parabola is the noncentral curve of the second order, defined by the canonical 
equation in Cartesian rectangular system of coordinates 


y= 2px, (11.14) 


where p > 0 is the focal parameter of the parabola, or simply the parameter. 
The vertex of the parabola is the origin of coordinates (0, 0), the focus is the 
point F(p/2, 0). The directrix of the parabola is the straight line specified by the 
equation x = —p/2 (see Fig. 11.5). 
From the geometrical point of view, the parabola is the set of the points of the 
plane, for each of which the distance to the focus F is equal to the distance to the 
directrix. The distance from the focus to the directrix is equal to the parameter. 


Fig. 11.5 Parabola y 
y? = 2px. The directrix , 
x = —p/2 is shown by a 
dotted line 


—p/2* \p/2 - 


+ 
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Theorem 11.5 The distance from the point P(x, y) lying on the parabola to its 
focus is equal to 


r=x-+p/2. (11.15) 


The parabola is assigned the eccentricity equal to one. 
For the parabola, the analogue of theorems 11.2 and 11.4 repeats its geometrical 
property, which allows formulating the following generalized statement. 


Theorem 11.6 For an arbitrary point that lies on the ellipse, hyperbola or 
parabola, the ratio of the distance from this point to the focus to the distance 
to the corresponding directrix (to the only directrix in case of parabola) is equal to 
the eccentricity of the curve. 


The equation of tangent to parabola at the point Po(xo, yo) has the form: 
yyo = p(x + x0). (11.16) 


Optical property of parabola: the light beams originating from the focus, after 
mirror reflection from the parabola, will be directed parallel to its axis of symmetry. 
Note that this property of parabola underlies the arrangement of parabolic mirrors 
and parabolic antennas. 


11.4 Degenerate Curves 


Among the degenerate curves of the second order are the curves whose canonical 
form is different from the equation of ellipse, parabola or hyperbola. There exist the 
following types of such curves: imaginary ellipse, pair of intersecting lines, pair of 
imaginary intersecting lines, pair of parallel lines, pair of imaginary parallel lines 
and pair of coincident lines. Thus, nondegenerate curves of the second order are 
ellipse, hyperbola and parabola (and they alone). 


11.4.1 Imaginary Ellipse 

Imaginary ellipse is described in the canonical system of coordinates by the equation 
x y 
ge (11.17) 


where a > b> 0. 
Since the sum of squares of real numbers cannot be equal to a negative number, 
this curve contains no points. The term “imaginary ellipse” is associated with the 
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fact that when changing the variables x’ = ix, y’ = iy, where i = /—1, in the new 
coordinates we obtain the equation of ellipse (11.2). 

11.4.2 Pair of Intersecting Lines 

The equation of the form 

=0 (11.18) 


corresponds to the pair of intersecting lines. Their intersection point is the origin of 
coordinates. 
Having written the Eq. (11.18) in the form 


~_ 2) (= ~) =0 11.19 
G b/ \a m b ; meee 
we conclude that it is satisfied by all points of two lines y = bx/a and y = —bx/a 


intersecting at the origin of coordinates. 


11.4.3 Pair of Imaginary Intersecting Lines 
Pair of imaginary intersecting lines is described by the equation 
Sa 0: (11.20) 
The sum of squares is equal to zero if and only if each of the summands is equal 


to zero: — 0 and es 0, i.e. x = y = 0. This condition specifies the only point 


a 
that coincides with the origin of coordinates. 


11.4.4 Pair of Parallel Lines 


The equation 
x* —a* =0, wherea £0, (11.21) 


defines two parallel vertical lines x = a and x = —a. 
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11.4.5 Pair of Imaginary Parallel Lines 


The curve of the second order, specified in the canonical system of coordinates by 
the equation 


x7 +a*=0 (11.22) 


under condition a ¥ 0, contains no real points. 


11.4.6 Pair of Coincident Lines 


And the final type of the general equation of curve of the second order is 
2_ 
x =0. (11.23) 


This equation is satisfied by the coordinates of all points lying on the ordinate axis. 

As a result, based on the canonical form of curves of the second order, they are 
subdivided into nine above classes, the most practically important being ellipse, 
hyperbola and parabola. 


11.5 Algorithms for Computing the Coordinates of the 
Tangent Points of Second Order Curve and the Straight 
Line 


2 
: bac, 
Consider a nondegenerate curve of the second order, for example, the ellipse — + 
a 


2 
aa = 1, and an arbitrary point of Cartesian plane P;(x1, y;). Let us provide the 
algorithm that computes the coordinates of the tangency points of the said curve 
and the lines passing through P, (see Fig. 11.6). 

We will begin with deducing the analytical relations for the coordinates of 
the sought points. Denote the tangency points by 7) (71, y;1) and T2(x;2, y;2). In 
Chap. 7 “Equation of a Straight Line on a Plane” it is shown that the equation of the 
family of lines passing through the set point P(x;, y,;) can be written in the form 
y — yy = k(x — x1), where k < oo is the slope. The case of a vertical tangent will 
be discussed separately. 
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Fig. 11. e ae to the 


ellipse ~ ot a = |, passing 
a 
through a point P; 


It is known that the tangent to the ellipse intersects it exactly at one point (see 
page 359). Therefore, the unknown slope & takes such a value that the nonlinear 
system of equations 


y-y =k — x1), 
2 (11.24) 
Pa ea 

b2 


has a unique solution. Having substituted the first equation of this system into the 
second one, we obtain the quadratic equation 


(a*k? + b?)x? — 2a*k(kx1 — y1)x + a*(kx1 — y1)* — ab? = 0. (11.25) 


The necessary and sufficient condition for the quadratic equation to have the only 
root is the discriminant’s equality to zero 


D = a4 (kx — 91)? +a? (a7? +B?) (b? = (eo — y1)”) 
= ab? (ak? +b? — (kx — y1)*) =0. (11.26) 


Solution of the equation D = 0 relative to the variable k allows writing the values 
of the slope: 


xpypt bx? + ay? —a*b2 
k= SSS = (11.27) 
xT a2 


Further, let us consider three cases depending on the mutual arrangement of the 
point P; and the ellipse. 
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yt 


1. The point P; lies inside the ellipse, i.e. — 14 J 2 < 1. The radical expression 
a 


x7b? + ya? — a’b? in (11.27) is negative, and there are no real k in this case. 
Therefore, it is impossible to draw a tangent through the point P;, which is 


situated inside the ellipse. 
2 2 


2. The point P; lies on the ellipse, i.e. al + a = |. Then the Eq. (11.26) has the 
a 
only root k = —— and P| is a tangency point. 


xj—a 
xy? 
3. The point P; lies outside the ellipse, + + s+ > 1. In this case, there are two 
possible values of k that satisfy two tangents. We will find the coordinates of the 
tangency points by substituting (11.27) into the Eq. (11.25): 


2 2 
a°k(kx1 — yi) b*(kx1 — yi) 
= ; = 11.28 
11,2 b+ alk Yr1,2 2 4 ake ( ) 
Now, let us consider a special case when x; = a, and one of the tangents is 


vertical (see Fig. 11.7). 

In this case, one of the tangency points has the coordinates (a, 0) or (—a, 0), and 
the second one is computed based on the equation D = 0, which leads to the condi- 

. ;—o byt 2b?y1 
tionk =+ and, consequently, to the coordinates (+ a———,, | 
2ay\ b? + y?’ b2 + y? 

if y; 4 0, and to the equation x = +a, if y; = 0. 

As a result, we formulate the algorithm for finding the coordinates of the 
tangency points of the straight line and the ellipse. 


x? y? 
1. ra ++ re < 1, then there are no tangency points. 
Yi 1 
R2 


cs 
2. If st + y= 1, then there is the only tangency point (1, y1). 
a 


Fig. 11.7 Tangents to the y 


ey 
ellipse — aa s = 1, when 


one of a them is vertical 
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3. If x1 = -£a, then there are two tangency points 


b?— yi 25? 
(x1, 0) and (+a a ~). 
be+yr be +yi 
In all other cases, k; and k2 are calculated by the formulae: 


xiv + 4/b2xt +a2y7 — a2b? xiy1 — \/b2xt +a2yt — a2b? 
k 


= x? — a? ee x? —a? 
(11.29) 
and the sought tangency points will have the coordinates: 


a*ki(kix1 — y1) (kx — yi) a?ko(k2x1 — y1) BP (kox1 —y1) 
b+ ake ° bb? + a2k? b+ arky * bh? + ake 
(11.30) 


Example 11.2 Let us apply the above algorithm to the values of the parameters 
a = 8,b = 5S and P\ (6,5). We obtain the coordinates of the two points of tangency: 


192 7 
T| 35° 5 and T2(0, 5). 


Review Questions 


. Write the general equation of a curve of the second order. 

. What curve is called an ellipse? 

. What is the eccentricity of an ellipse? 

Define directrices of an ellipse. 

. Tell about the optical property of an ellipse. 

. What curve is called hyperbola? 

. What is the eccentricity of a hyperbola? 

. Define directrices of a hyperbola. 

. Tell about the optical property of a hyperbola. 

. What curve is called parabola? 

. What is the eccentricity of a parabola? 

. How are directrices of a parabola defined. 

. Tell about the optical property of a parabola. 

. Write the equation of a tangent to ellipse, hyperbola and parabola. 
. What curves of the second order are referred to as degenerate ones? 


eet enti oe tll eet oo oe’ 
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Problems 
x2 y? 
11.1. For the ellipse Ta? — 1, compute the eccentricity and write the equation 
of directrices. 
vy? 
11.2. Find the shortest distance from the ellipse re + ee 1 to the line: 


(1) x+2y-—5=0; 
(2) 2x+y—-5=0. 


11.3. The ellipse x? + 4y? = 5 is given. Find the equation of the line that is 


tangent to this ellipse at the point with the coordinates x = 1, y = —1. 
11.4. Find the angle between the tangents drawn from the point (—8, 2) to the 
2 2 
ellipse ae aie 1 
24 «12 
Pe 2 
*11.5. From what points of Cartesian coordinate plane is the ellipse — + Bm 1 
a 


seen at a right angle? 
11.6. The semiaxes of an equilateral hyperbola are equal: a = b. Find the 


eccentricity of the equilateral hyperbola. 


2 2 
11.7. Find the angle between the asymptotes to the hyperbola aa - mn = 1. 
a 


11.8. Find the distance from the point (4, 0) to the curve y* — 2x = 0. 
11.9. Write the equation of tangent to the parabola y* = 5x at the point nearest 
to the point Mo(2, 1/2). 

11.10. The representation of hyperbola in parametric form (11.13) has a dis- 
advantage consisting in that its left and right branches are described by 
different expressions with different signs. Find the parametric specification 
of hyperbola that does not have the said disadvantage. 

*11.11. Assume that it is known that the line Ax + By + C = 0 is tangent to 
the ellipse whose focal distance is equal to 2c. Set up the equation of this 


ellipse. 
x2 y? 
11.12. Find at what points the line —3x + 3y — 2 = 0 and the ellipse cs + a 1 
intersect. 
11.13. At what points do the line x — 3y — 2 + 3/3 = 0 and the hyperbola 


x? — y? = 1 intersect? 


11.14. At what points do the line x + y — 5/5 = 0 and the parabola y* = 6x 
intersect? 

11.15. The eccentricity of Mercury’s orbit is equal to 0.2; the major semiaxis is 
equal to 0.39 astronomical units (a. u.). Compute the greatest and the least 
distances of the planet from the Sun. 

11.16. The eccentricity of the Earth’s orbit is equal to 0.017; the major semiaxis is 
equal to one astronomical unit. What are the greatest and the least distances 
from the Earth to the Sun? 
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Fig. 11.8 The region ay 
bounded by the ellipse and ; 
the right branch of the 
hyperbola 
M(2m, Ym) 
> 

—a a x 

11.17. Assume that it is known that the eccentricity of the ellipse is ¢ = 1/2. In 


11.18. 


*11.19. 


the canonical system of coordinates, one of its directrices is specified by the 
equation x = 12. Compute the distance in this coordinate system from the 
point M, of the ellipse with the abscissa equal to —1 to the focus unilateral 
with this directrix. 

Assume that it is known that the eccentricity of the hyperbola is ¢ = 3/2. In 
the canonical system of coordinates, one of its directrices is specified by the 
equation x = —4. Compute the distance from the point N of the hyperbola 
with the ordinate equal to 9 to the focus unilateral with this directrix. 


2 2 
Find the area of the regions bounded by the ellipse = + = = | and the 
vy? ‘s 
right branch of the hyperbola —; — Bz = | (see Fig. 11.8, where the said 
a 
region is colour-highlighted). 
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11.1 Solution. 
The semiaxes of the ellipse are equal to a = V18 and b = 3. Compute the 
eccentricity of the ellipse: 


a*—b? /18—9 


c 
| SE eee 
a 


1 
a eg a 


Write the equations of directrices: 
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11.2 Solution. 


(1) Find the equations of two lines tangent to the ellipse and parallel to the line 
x + 2y —5 = 0. For this, write the equation of tangent to the ellipse: 


Xox YOY _ 1 


az b2 


where (xo, yo) is the point of tangency, a? = 4, b> = 3. Thus, the equation of 
tangent takes the form: 


3x0x 3 
= — i 
4yo Yo 
; 1 
The slope of the line x + 2y — 5 = 0 is equal to k = ——. As is known, 
the slopes of parallel lines coincide. Therefore, the following equality is valid: 
3 
_ — = => Moreover, the point (xo, yo) belongs to the ellipse. As a result, 
YO 
we obtain the system of equations: 
3x0 = 1 
4yy 2 
2? 
04 
4 3 


The set of its solutions has the form {(1, 3/2) , (—1, —3/2)}. 
We obtain the equations of tangents: 


x+2y-2=0, x+2y+2=0. 


Note that the first of them is located closer to the initial line, since the 
following inequality is valid: abs((—2) — (—5)) < abs(2 — (—5)) (see 
Problem 7.13). 

It only remains to find the distance from the point (1, 3/2) to the line x + 
2y—-5=0. 

Let us use the formula (7.37): 


3 
b A B abs(1-1+2-=-—5) 
d = abs(s) = aSAxa+ Byo tC) _ DT 2 VS 


JAP + BE JPre 3 


2 4,2 
So, the shortest distance from the ellipse - + 7 = | tothe linex+2y—5= 


5) 
0 is equal to zi 
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(2) Find the equations of two lines, tangent to the given ellipse and parallel to the 
line 2x + y — 5 = 0. Similarly to the previous item of this problem, we obtain 
the system of equations relative to the unknown coordinates of the tangency 
point xo and yo: 


8 3 8 3 
We obtain the set of solutions —, — _}, | -——, —-—— _]} } . Find the 
(+ =) ( V19 =) 


equation of the tangents: 
y= -2x4+vV19, y= —-2x-V19. 


Note that the first of them is closer to the initial line, since the following 
inequality is valid: abs(—/ 19 — (—5)) < abs(./19— (—5)) (see Problem 7.13). 


The distance from the point | ——, —— } to the line 2x + y —5 = O will 
V19 V19 2 
be found by the formula (7.37): 
bs(2 : + : 5) 
a S — oo — 
__ abs(Axo + Byo + C) _ J 19 JI _ 5¥5 — 95 
VA? + B? J+ 7 5 ; 
x2 oy? 
As a result, the shortest distance from the ellipse a + z = | to the line 
1 
2x + y —5 = is equal to /5 — 5V 9 
11.3 Solution. 
Use the equation of tangent to ellipse (11.4): 
xxo | YO _ 
a! pO 
4 79 : 
In our case, a~ = 5, b* = —, x9 = 1, yo = —1, and the equation of tangent takes 
1 .(-1 
the form —— ye) = l,orx —4y-—5=0. 
e) 5/4 


11.4 Solution. 
According to the problem statement, through the point (—8, 2) pass both tangents 
to the ellipse. 
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As is shown in Sect. 11.5 (formula (11.27)), the slopes of the tangents k; and k2 


are computed by the formula 
xy, £,/b2xt +a7y7 — ab? 


2 


ki2= 5 
xy —d 


We obtain kj = 1/5, k2 = —1. 
Based on the (7.17), we find the angle between the lines. 
kj —ko 1/5 — (-1) 3 


g = arctan ————— = arctan ———_—_——__ = -. 
1+ kiko 1+(1/5)(-1) 2 


3 
Therefore, the angle between the tangents to the ellipse is equal to g = arctan > 


11.5 Solution. 

Since the ellipse seen at a right angle from some point M(x, y), then the angle 
between the tangents drawn from M to the ellipse is equal to 7/2. Based on the 
formula (11.27) on page 366 the slopes of the tangents k; and kz are equal to 


xyypt 4) Boag + ay? —a*b2 


2 2 
xX; —a 


i2= 


Compute the angle g between the tangents using the equation (7.17): 


therefore, 


ki —-k 2 hex? + ay? — ab? 


= arctan —————- = arctan ————__.—___—_ 
? 1+ kiko x+y? —a? — Bb? 


It follows from the obtained equation that the angle g = 2/2, if the denominator of 


fraction under the arctangent is equal to zero: x; + yy —a’—-b* =0. 
2 2 
x 
As a result, the ellipse — + a = | is seen at a right angle from all points of the 
a 
plane that satisfy the equation 


vr+y=at+b’. 


Such points, as is easy to see, form a circle of radius a? + b?. 
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11.6 Solution. 2 
The eccentricity of the hyperbola is equal to se = —, where c = Vb? + a? is half 


a 
of the interfocal distance, a, b are semiaxes of the hyperbola. 
If a = b, then 


/b? + a2 /a2 + a2 
g = = = FV. 
a 


a 


Thus, we obtain that the eccentricity of the equilateral hyperbola is equal to V2. 


11.7 Solution. 
2 2 


The asymptotes to the hyperbola a - a = 1, as is known (see page 361), 
a 


b 


are defined by the equations y = +—x. The slopes of these lines are equal to +-. 


a a 
Therefore, the tangent of half of the angle a between the asymptotes is equal to 


tan(a/2) = —. Let us express, from the obtained relation, the sought angle: 
a 


b 
a = 2arctan—. 
a 


Note that for the equilateral hyperbola (see Problem 11.6), the angle between the 
asymptotes is equal to 7/2. 


11.8 Solution. 
In order to find the distance from the point (xo, yo) to the curve y? —2x = 0, 
find the least value of the functions 


d\(x) = \/ (« — x0)? + (fix) — yo)? and 
dy(x) = yf (x — x0)? + (fax) — yo)?, 


where x9 = 4, yo = O,7 and by fi,2(x) = +,/2x are denoted two branches of the 
parabola y? — 2x = 0. 
First, consider the function d)(x): dj(x) = V(x — 4)? + 2x. Its derivative is 


x—-— 


JV (x — 4)? + 2x 


the solution of the equation d; (x) = 0, or x — 3 = 0. This is the minimum point. 
Therefore, min d)(x) = (3 —4)2+2-3= v7. 

Similarly, we find the least value of the function d3(x): min d2(x) = V7. 

As a result, we obtain that the distance from the point (4, 0) to the parabola is 
equal to /7. 


11.9 Answer: 
5x —2V710y+ 10 = 0. 


equal to d} (x) = . The point suspected of being the extremum is 
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11.10 Answer: 


a 


cost’ 
y = btant, 


where t € (0, 277). 


11.11 Solution. 
Denote the tangency point by P(x, yo). The equation of tangent to ellipse at this 
point has the form: 


xx | YO _y 
i 


where a and b are the major and minor semiaxes of the ellipse, respectively. On the 
other hand, according to the problem statement, the general equation of line has the 
form Ax + By + C = 0, and it can be rewritten as 


(-8e(-Bpen 


: : : . x0 A yo B 
Comparing the obtained equations, we obtain = ——, Re therefore, 
a 

the coordinates of the tangency point P are equal to (—Aa? /C, -—Bb*/C 3 Itis clear 
x) %0 | 
a =F eee 
Moreover, from the definition of focus follows the equality a? — b* = c*, where 
c is the focal parameter. Hence, we obtain the system of equations relative to the 


parameters a, b: 


that the coordinates of the point P satisfy the equation of the ellipse 


1 Aa2\2 1 Bb?\2 
oe) pe) 
a2 —b? = ¢?, 
or: 
a2A2_—b* B? 
Ge” 
ae—-bP=c?. 


C= ce) 1/2 ‘e = =) 1/2 


The soluti f this system is: =(—r [SS - 
e solution of this system is: a ADB? ADB? 
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Thus, the sought equation of ellipse has the form: 


A* + B? oe A’ +B? 4 
C2 — c? B? C2—c2A 
11.12 Solution. 


Express y from the equation of line: y = x + -. 


Substitute y into the equation of ellipse: 


x2 1/4 4 4 
ae atz:x +x") =), 


5 Be . _ 8 _ ee: _ 2 
or 27x* + 12x — 32 = 0, hence, we obtain x} = —, yj) = —3x2 =—--,y2 =—-. 


So, the intersection points of the line and the ellipse are (8/9, 14/9), 
(—4/3,—2/3). 
11.13 Solution. 

Express the variable x: x = 3y +2 — 3/3. 


In order to find the points at which the line and the hyperbola intersect, substitute 
the obtained expression for x into the equation of hyperbola: 


Oy? + 6y(2 — 373) + 31 — 1273 — y? -1=0, 


or 
8y2 + y (12 2 18/3) + 30— 1273 =0, 
5V3-6 3/3 — 10 
hence: y; = eee = ele y= J3, x2 = 2. 
373-10 573-6 
Thus, the sought intersection points are eae —) (2, J/3). 


11.14 Solution. 
In order to find the intersection points of the line and the parabola, solve the 
system of equations: 


xt+ty—5/5=0, a x=-yt+ 555, 


2 
=> y+ by —30V5 = 0, 
y* = 6x y? = —6y + 30/5 


The obtained quadratic equation has the roots 


yi2 = —3 £94 30V5. 
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Substitute the values yj,7 into the equation of a straight line x + y — 5/5 = 0. We 


obtain 
x12 =3 FV 9+ 30V5 +5V5. 


Thus, the coordinates of the intersection points are: 


M3 —V9 + 30/5 + 5/5, —3 + V9 + 30/5), 
Mo3 + V9 +3075 + 5V5, —3 — V9 + 30/5). 


11.15 Solution. 

According to the first law of Kepler, all planets of the Solar system move along 
an ellipse, in one of the foci of which the Sun is situated [10]. The closest to the 
Sun point of the orbit P is referred to as the perihelion, and the farthest from the 
Sum point of the orbit A is referred to as the aphelion. 

In order to compute these values for the planet Mercury, introduce the notations: 
F{(c, 0) are the coordinates of the Sun, P(a, 0) are the coordinates of the perihelion, 
A(—a, 0) are the coordinates of the aphelion and O (0, 0) is the centre of symmetry 
of the ellipse. Here, a is the major semiaxis, and c is the focal parameter. 

Then: 


rq = AF — the greatest distance from Mercury to the Sun, 
rp = PF, — the least distance from Mercury to the Sun, 


c= OF. 


Geometrically, we obtain 


a=rjt+ec, rp =a-C, 
P 4 


a="q-C; Ta=at+e; 


The eccentricity of the ellipse ¢ will be computed by the formula: 


where a2 — b? = c¢, 


‘Johannes Kepler (1571-1630), German mathematician and astronomer. 
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Hence, 


rp =a-—ae, 


Tq=a+t+ae; 


rp =a — <8), 


rq =a(l_ +e). 


Substitute the numeric values from the problem statement: 


rp = 0.39- (1 — 0.2) > rp = 0.312 (a. u.), 
rq = 0.39- (1+ 0.2) > rg = 0.468 (a. u.). 


So, the greatest distance from Mercury to the Sun is equal to 0.468 a.u., the least 
distance is equal to 0.312 a.u. 


11.16 Solution. 

Denote the points F\(c,0) as the coordinates of the Sun, P(a,0) as the 
coordinates of the perihelion, A(—a, 0) as the coordinates of the aphelion, O(0, 0) 
as the centre of symmetry of the ellipse. Here, a is the major semiaxis, and c is the 
focal parameter of the Earth’s orbit. Then (see previous problem): 


'q = AF}, 
lp = PFi, => 
c= OF;; 


rp =a — 8), 


rq =a(1 +6); 


where ¢ = 0.017 is the eccentricity of the Earth’s orbit, a = | a.u. 
Perform the necessary computations: 


rp = 1-(1—0.017) = rp = 0.983 (a. u.), 
rq = 1-(1 +0.017) > rq = 1.017 (a. v.). 


As a result we obtain that the greatest distance from the Earth to the Sun is equal to 
1.017 a.u., and the least distance is equal to 0.983 a.u. 
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11.17 Solution. 

The point Mj has the coordinates (—1, y;) and is situated on the left of Oy- 
axis, and the focus is on the right. The sought distance is equal to the length of the 
hypotenuse r of the triangle with the vertices Fj (c, 0), M\(—1, yi) and L(—1, 0). 

According to the Pythagorean theorem: 


r= (c+1*+y?. 


Using the definition of directrix of ellipse x = ie we find 
€ 


a 
Xd=-, => a= E€&Xd, 
E 


where, according to the problem statement, xg = 12. 
Express the focal parameter c of the ellipse: 


cC=ae= e-XxXd. 


2 


The coordinate y; will be determined using the equation of the ellipse ( 


+ 
a2 


2 
ol 22 
b2 


yer 


Since for the ellipse the following relation is valid: b? = a* — c* = a?(1 — €”), 
then 


y=HatVv ad = 62) (a2 = 1). 


Therefore, the length of the hypotenuse F; M, of the triangle F; LM is equal to 


r= V(c+1)?+y7 = V(ae+ 1)? 4+ 1 —82)@? -1) =ate=e(xg + 1). 


Substitute here the numeric values from the problem statement, and we obtain the 


13 
final answer: r = oe 


11.18 Solution. 
The sought distance is equal to the length of the hypotenuse of the triangle with 


the vertices F2(—c, 0), Ni(x1, yi) and L(x11, 0), where, according to the problem 
statement, yj) = 9. Depending on whether the point Nj (x1, yi) is situated on the 
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right or on the left branch of the hyperbola, we obtain two possible values of r: 


r=,/(x1 +c)? + y?. 


According to the definition of directrix of the hyperbola x = Ae we have 
€ 


a =eabs(xqg), where xg = —3. 
The parameter c of the hyperbola is equal to 


c=daé= e7abs(xq). 


—] 2 2 
Since ( ws _ a = | andc? = a” — bd’, then 
a 


2 
Xx 
r= (xypt cy’ + bs a 1) =(a+ xy)? — e*(abs(xq) =e x1)’, 


2 
where xj = ; / oA +b? = eal i + exe. Having substituted the values from 
E —- 


/14 /14 
the problem statement, we obtain r = 9 7 +6orr=9 37 6. 


11.19 Solution. 

Denote the region bounded by the ellipse and the hyperbola by D. Due to the 
properties of symmetry of ellipse and hyperbola, the sought area is expressed as the 
doubled area of the region Do: 


S(D) = 2S(Do). 


In Fig. 11.8 the region Do is marked with the symbol “x”. 
The equation of the part of the ellipse lying in the first quadrant has the form 


b 
Yell = —Va2 — x2, where 0 < x <a. 


a 
The equation of the part of the right branch of the hyperbola, which is situated 


above the abscissa axis, is: Yhyp = —Va* — x*, where x > a. 


Assume that xy is the abscissa of the point M of intersection of the curves ye 
and Ynyp, which point lies in the first quadrant. 

Note that when the condition a > a is satisfied, such a point always exists. 
Otherwise, when a < q, the region D is empty, and S$(D) = 0. 
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So, under the condition a > a, the following equality is valid: 


a 


S(Do) = /  ynyp (x) dt + i yeux) dx. (11.31) 


M 


Compute the coordinate xy: 


b B 
Yell(XM) = Yhyp(xm) => _ Pa ee 2 2 


Solution of this equation that satisfies the conditions x > 0, y > 0, has the form: 


b2 + p? 
XM = aQa,|! za 
MM ap? + bo 


Substitution of xj into the formula (11.31) for S(Do) results in: 
1 B 

S(Do) = = | ab arctan — 

2 a 


a — a2 b? + p 


se ee eee Dp? + abe 


Finally, we obtain the area of the region bounded by the ellipse and the right 
branch of the hyperbola: 


0, ifa <a, 


‘ ; B az +az 
S(D) = a eg Pte 
2 2 2 2 
a“ —a b+ 6 : 
Sa Oe eee 


Chapter 12 ® 
Elliptic Curves od 


Elliptic curve is a plane curve that has no singular points and is defined by an 
equation of the form 


y2 =x +ax+b, (12.1) 


where a and b are real numbers. The requirement of absence of singular points 
means that the curve must not have any self-intersection and cusps.! This condition 
will be satisfied if and only if the discriminant of the equation 


A = —16(4a? + 27b*) £0 (12.2) 


is other than zero. Of course, the constant factor 16 does not influence the sign of 
the discriminant; this factor is introduced for convenience of investigation of further 
and deeper properties of the curve. 

The name “elliptic curve” goes back to the problem of computing the length of 
the ellipse arc, leading to computation of the definite integral of the form 


x2 R(x) 
x Vxe+ax+b 
for some rational function R(x) [7, 32]. Morphological similarity of the terms 


“ellipse” and “elliptic curve” is due to historic reasons. Let us emphasize that these 
terms belong to different mathematical concepts. 


dx (12.3) 


‘An example of self-intersection see on Fig. 12.6 at page 403. An example of a cusp is shown on 
Fig. 12.le at page 385. 
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As a sphere of application of elliptic curves, we should mention the proof of 
Fermat’s” Last Theorem and the discovery of a new and rapidly developing scientific 
field about the confidential data transfer methods—elliptic-curve cryptography. 
There are other known applications of such curves in mathematics and adjacent 
fields, in particular, in number theory [13, 14]. 

Depending on the sign of the discriminant, the elliptic curve is formed by one or 
two connected components: 


e if A > 0, then the curve graph consists of two connected components; 
e if A <0, then the curve graph consists of one connected component. 


Note If the conditions A = 0 anda ¥ 0 are satisfied, a point of self-intersection 
appears on the curve, and if the equalities a = b = 0 occur, then a cusp appears. 


Example 12.1 In Fig.12.1 are shown several elliptic curves for the values of 
the parameters a € {—4, 0, 4}, b € {—2, 0, 2}. Note that at a = b = 0 we obtain the 
equation y* = x°, for which A = 0, therefore, such a curve does not belong to the 
class of elliptic curves. In Fig. 12.1 it is located on the panel e) and is shown by the 
dotted line. 


12.1 Operation of Multiplication of the Elliptic Curve Points 


Having fixed two points located on an arbitrary elliptic curve Tl = {(x,y) € 
R*: y? = x? + ax + b}, we can deduce a rule for constructing the third point. 
Such an operation will be referred to as “addition” of points on an elliptic curve. 

In order to perform the operation of addition of points, the Cartesian plane R? 
should be expanded by introducing a point at infinity oo. As will be clear from 
the further discussion, the point oo has the properties of a zero element in the set 
IR? U {oo} with the operation of multiplication of points defined on it. Due to this, 
the designation O = oo is also used for the point at infinity. 

Assume that any elliptic curve passes through the point O. We can say that two 
vertical lines intersect at this point. 

So, let the following elliptic curve be specified 


Tl ={(x, y) € R*: y? =x3 +x +d} U {O}. (12.4) 
The main idea of determining the sum A @ B of the two points A = (x4, ya) 
and B = (xg, yg) on the elliptic curve consists in computing the coordinates of the 


intersection point of the line AB andT: 


C’=A@B=({(xc, yc): C € ABandC €T} (12.5) 


2Pierre de Fermat (1601-1665), French mathematician and lawyer. 
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a) a b) , 4 c) 


=r —4Ag—2 y= 2-42 
Y ys, 
d) e) lf 
% 
yar =—2 y= 2x 
+ y y 
9) h) i) 
x 2 x 
y=2+4r—-2 y=o+4r y =a? +4r+2 


Fig. 12.1 Elliptic curves for the values of the parameters a € {—4,0,4}, b € {—2,0, 2}. 
Rectangular mesh on the panels (a)-(i) has a step equal to 2. The point (0,0) on panel (e) is 
the cusp for the curve y~ = x 
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and the subsequent reflection of C’ relative to Ox-axis: C’ + C. The point C is 
deemed as the sum of the points A and B; the remaining part of this section will be 
devoted to the formal definition of such a summation operation. 

In this case, we suppose that 


. the vertical lines of the form x = const pass through the point at infinity; 
. if the line AB is tangent to the curve I, then the tangency point is taken into 
account twice. 


Ne 


The point on the elliptic curve, opposite to the point A = (x4, ya), is the point 
—A = (x4, —ya), resulting from the reflection of the initial point relative to the 
abscissa axis. As is easy to see, A € F! => —A €T due to quadratic dependence of 
the algebraic equation that defines I on the variable y. 

A new rule is introduced for the point at infinity: —oo = on, in other words, the 
point co is deemed opposite to itself. 

Let us proceed to the rule of computing the coordinates of the point A @ B. For 
this, consider four cases: 


. addition of a point and O; 

. addition of two different points, where A 4 —B; 

. addition of the two opposite points A and —A; 

. duplication of a point, i.e. computing the sums of the form A @ A. 


BRWN Re 


12.1.1 Addition of a Point and O 


Let us define the sum of an arbitrary point A(x,, ya) of the expanded Cartesian 
plane and the point O as 


A®BO=O@GA=A. (12.6) 


This means that addition of the point O does not change the values of the initial 
point coordinates. As was already mentioned above, this fact allows deeming the 
point at infinity to be the zero element in the set R? U {oo} with the operation of 
addition defined on it. 


12.1.2 Addition of Two Different Points 


In order to compute the sum A ® B on the condition A #4 —B we should find 
the intersection point of the line AB with the specified elliptic curve I’. Denote the 
respective intersection point by C’. Then we find the point C = —C’ opposite to C’, 
reflecting the point C’ relative to the Ox-axis. This, the coordinates of the point C 
and C’ are connected by the relations xc = xc’, yo = —Yc’. 
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Fig. 12.2 Addition of two 
different points A and B on 
the elliptic curve 


The sum of the points A and B is the point C, constructed in the above manner: 
C=AOB. (12.7) 


In Fig. 12.2 is shown a geometric method of constructing the point A ® B. 
Solution of the system 


(-yc)? = xp +axc +b, 

YB—YA 
"(xc — xa) 
XB — XA 


12.8 
(-yce) -— yA = vate 


reduces to solving the equation of the third degree with real coefficients. Of course, 
the roots of such an arbitrary equation can be found using known Cardano formulae 
[41]; however, in this case, we can perform easier computations. 

Using Viéte’s formulae (see Problem 4.31), we may say that the sought coordi- 
nates of the point C are determined by the formulae: 


ey) ae _ 
i a et (12.9) 
YB—YA 


where the designation x = is introduced for the slope of the line AB. 


XB —MXA 
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Fig. 12.3 Addition of two 


opposite points A and —A y 


jo = A@ (LA) 


12.1.3 Addition of Two Opposite Points 


Let us proceed to the case when the summands in the sum of the points are opposite 
points, for example, A and —A. 

Draw a line that passes through two opposite points. As can be seen from the 
Fig. 12.3, such a line will be positioned vertically, and the third point, which is 
common for the elliptic curve and the drawn line, can only be positioned at infinity. 
Based on this fact, the sum of the opposite points is defined as A @ (—A) = oo. The 
Fig. 12.3 illustrates the formulated rule. 


12.1.4 Duplication of a Point 


The computation of the sum of the points of the form A © A, i.e. duplication of a 
point, will require the operation of passage to the limit: 


A®A= lim AOB. (12.10) 
BoA 


Geometrically it means that we are drawing a line through two different points 
A(xa4, ya) and B(xz, yg) of the elliptic curve, and the second point lies in the small 
neighbourhood of the first one: 


lm A®B= lim AOB. (12.11) 
BoA XBXA, 


YB-*YA 
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a) 


Fig. 12.4 Duplication of a point A. On the panel (a), the case ya 4 0 is schematically shown; on 
the panel (b) the case ya = 0 


As is seen from the Fig. 12.4, this case corresponds to drawing a tangent at the 
point A to the elliptic curve. 

Suppose that the tangent intersects the curve at the point —C. Note that if the 
ordinate of the point A is equal to zero, then the said tangent will be positioned 
vertically. Based on the previous case described in Sect. 12.1.3, we obtain —C = om, 
A@®A=A®(—A) = O. For all other values of the ordinate y4 4 0, as a result of 
reflection of the point —C relative to the horizontal axis, a point C = A @ A will be 
constructed. 

Let us write the respective analytical relations: 


2. 
= x2 — 2x, 
ee ee (12.12) 
yo =—Yyatx(x4 — Xc), 


3 2 
where the value x = ele is equal to the tangent of the slope of the tangent line 


relative to the horizontal ne 

Two other cases of point duplication discussed above are illustrated in Fig. 12.4a 
and b. 

Analysis of the formulated addition procedure leads to the criterion of equality 
O of the sum of three points. 


Theorem 12.1 A sum of three points is equal to O if and only if they lie on the same 
straight line. 


Example 12.2 Compute the sum (—2, —1) © (0, 1) of two points on the elliptic 
curve defined by the equation y* = x* — 4x + 1. 


Solution Denote A = (—2, —1), B = (0,1) andC = A@B. Verify that the points 
A and B lie on the curve y? = x? — 4x +1. 
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Indeed, 
A: (1)? =(—2)? —4- (-2) +1, or 1 = 1 is true, (12.13) 
B: ?=0°-4-041, or | = | is true. (12.14) 


By definition, in order to compute the sum A @ B we should find the intersection 
point of the line AB with the specified elliptic curve, following which we should 
reflect the intersection point relative to Ox-axis. 

Write the equation of the line that passes through two points (see Eq. (7.12) on 
page 280): 


B— YA 
y-YaA= ee — Xa), (12.15) 
XB—XA 


where (x4, y4) and (xg, yg) are the coordinates of the initial points. 

Having substituted the numeric values of the coordinates into Eq. (12.15), we 
arrive at the equation of the line y = x + 1. The line specified by this equation 
intersects the elliptic curve y* = x? — 4x + 1 at a point with the abscissa xc, for 
which the following equality is valid 


(xc +1)? = x26 —4xc +1, (12.16) 
or 
xe — x2, — 6xc = 0. (12.17) 
As is easy to see, the obtained cubic equation has the roots 
(o)1=-2, (c)2=9, (Xc)3 = 3. (12.18) 


The first two values satisfy the abscissas of the initial points A and B; the value 
Xc = 3 is the abscissa of the intersection point of the line AB and the elliptic curve. 
Denote this point as C’: xc = 3. 

Having substituted xc, into the equation y = x + 1, we obtain yc, = 4— the 
ordinate of the point C’. 

The coordinates of the sought point C = A © B are equal to 


i clad (12.19) 
yo = —Yc'- 
As a result, xc = 3, yc = —4, therefore, on the curve a = x? —4x + 1 the 


following equality is valid 


(—2, -1) ® (0, 1) = (3, —4). (12.20) 
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Note that the use of the relations (12.9) also leads to the desired result. 


Example 12.3 Compute the sum (—2, 3) © (3, —3) of two points on the elliptic 
curve specified by the equation y? = x? — 7x + 3. 


Solution Verify that the points (—2, 3) and (3, —3) belong to the curve. 

The first point: 37 = (—2)? — 7(—2) + 3, or 9 = 9 is true; 

the second point: (—3)* = 33 — 7-343, or9 = 9 is true. 

In order to compute the sum, use the formulae (12.9), into which the values 
x4 = —2, v4 = 3, XB = 3, yg = —3 should be substituted: 


x= 32D 8" (12.21) 
= sy 2 3= u (12.22) 
ve= (-§ —(-2) - = 95" : 
6 11 9 
yco= 3+ ( 9) ( 2 =) — ~T35" (12.23) 


So, by algebraic method, we have obtained the equality (—2,3) @ (3, —3) = 


11 9 
25° 125)° 


For the sum of the points of the form C = A@®A@A®-:--@A the 
a 
n times 


designation C = n A is used. 
For n < 0 the point v A is defined as the element opposite to (—n) A. 
Multiplication by zero results in a point at infinity O. 
Thus, the point n A, which is the sum n of the points A, is defined for all integral 
n in accordance with the following rule: 


A@®AQAQ::-@A, ifn >O, 
—————— 
rom n times 12.24 
” O, ifn =0, ee) 
inlA, ifn <0. 


Example 12.4 Compute the sum of four points 4(5,11) on the elliptic curve, 
defined by the equation y* = x? — 2x +6. 


Solution Verify that the point (5, 11) belongs to the elliptic curve: 
117 = 5° —2-5+46, or 121 = 121 is true. (12.25) 
Use the equality 


4(5, 11) =(65,1D4+ 6,1)4+6,1D4+6, 11) = 26,11 425,11). (12.26) 
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By the duplication formulae (12.12) we obtain 


73 
t= aa 2(5, 11) = (489/484, 23835/10648). (12.27) 
Applying the duplication formula again and using a program in Python for 
computing, we arrive at the final answer: 
4(5, 11) = (— 2 160508 643 999/1 099 855 587 600, 
— 1767794 172 358 992 7751/1 153 462 548 939624000). (12.28) 
As is seen from the considered example, the arithmetic operations with the 


elliptic curve points are in many cases very labour-intensive and it is almost 
impossible to obtain the result without application of computing systems. 


Theorem 12.2 The operation of addition of the points defined on the elliptic curve 
I on the set R* U {O} has the following properties: 


1A®@O=O@AH=A VAET; 
2.A@(-A)=O VAeT; 

3. A@B=BOA VA,BeT; 

4. (A®B)®C=AOG(BOC) VA,B,C EF. 


The result of the operation “@” belongs to the same set R? U {O}. 


In other words, the operation “@” is commutative and associative. The role of 
the opposite to A is played by —A; the role of zero (neutral element) is played by 
the point O = oo. 


12.2 Elliptic Curves with Rational Points 


In the previous sections of this chapter, as the universal set was considered a 
Cartesian plane completed with a point at infinity oo: 


U=R’°U{O}. (12.29) 


Recall that the universal set, by definition, contains the entire set of values that the 
variables can take in a certain problem [41]. For cryptography and number theory, 
the most significant are the properties of the elliptic curves on the set of points with 
the rational coordinates Q? U {00}, where Q = {p/q: p,q € Z, q # 0} isa set of 
rational numbers. As in the case of a standard Cartesian plane, an additional point 
oo is introduced, which is considered to be rational. 

So, let us fix as the universal set 


U = Q* U {oo}. (12.30) 
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The operation of multiplication of points is defined by the same rule as in the 
previous section. Formulate this rule in the form of an algorithm. 


Algorithm of addition of points on the elliptic curve The sum of the two points 
A(xa, ya) and B(xpg, yg) on the elliptic curve 


E={(x,y\eQ@: vy =x +ax4+d}U{O} (12.31) 


is the point C(xc, yc), whose coordinates are found by the following rule. 


1. fA=O, thnC=A@®B=B. 
2. If B=O,thnC=A@®B=A. 
3. Ifx4 =xpand ya = —ypg, thnC=APBB=0O. 
4. In other cases, the parameter x is computed: 
ee. Ae. 
XB —XA 
x= - (12.32) 
3 
eT Gna if A = B, 
2YA 


and the coordinates of the point C will be equal to 


xc =x? —XA—Xp, (12.33) 


yo = *(X4 — XC) — Ya. (12.34) 


Here, the procedure of addition of the points is defined specifically, since it 
does not take the result beyond the extended set of rational numbers. Indeed, all 
usual arithmetic operation—addition, subtraction, multiplication and division—are 
performed in an algorithm on rational operands, which eventually results in rational 
numbers. Division by zero in an algorithm cannot take place due to preliminary 
processing of the exceptional case B = —A. Therefore, a theorem similar to 
theorem 12.2 occurs. 


Theorem 12.3 The operation of addition of the points defined on the elliptic curve 
& on the set Q? U {O} has the following properties: 


1.A®O=O@AH=A VWAESE; 

2. A®(-A)=O VAEE; 

3. A@B=BOA VA,BESs; 

4. (A@B)®@C=A@(BOC) VA,B,CEE. 


The result of the operation “@®” belongs to the same extended set of points with 
rational coordinates. 


As we can see, addition of rational point on an arbitrary elliptic curve has all the 
properties of a regular operation of addition of rational numbers. 
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12.3 Implementation of the Addition Algorithm 


Let us consider a software implementation of an algorithm of addition of rational 
numbers on an elliptic curve. The program’s input data are the coordinates of the 
points A and B; the coordinates of the sum A @ B on the set Q? U {O} are entered 
into the resulting file. 

Listing 12.1 


class RationalFraction(object) : 
def init (self, n, qd): 
self.n 
self.d 


__eq_ (self, other): 
return self.n == other.n and \ 
self.d == other.d 


__neg_ (self): 
return RationalFraction(-self.n, self.d) 


class RationalPoint (object) : 
def init (self, x, y): 
self.x =x 
self.y = 


def eq (self, other): 
return self.x == other.x and \ 
self.y == other.y 


# Denominators of coordinates of the point at 
infinity 
# are equated to zero 
RationalPoint (RationalFraction(1, 
RationalFraction(1, 


# Parameter a of an elliptic curve 
RationalFraction(-4, 1) 


33|def add(pl, p2): 
34 if pl.x.d == 0 or pl.y.d == 0: 
35 return p2 
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36 elif p2.x.d == 0 or p2.y.d == 0: 

37 return pl 

38 elif pl.x == p2.x and pl.y == (-p2.y): 

39 return O 

40 else: 

41 k = RationalFraction(0, 0) 

42 c = RationalPoint (RationalFraction(0, 0), \ 

43 RationalFraction(0, 0)) 

44 

45 if pl == p2: 

46 k.n = pl.y.d * \ 

47 (3 * pl.x.n *« pl.x.n * a.d+ \ 

48 a.n * pl.x.d * pl.x.d) 

49 k.d = 2 « pl.y.n *« pl.x.d «* \ 

50 pl.x.d * a.d 

51 

52 if k.d < 0: 

53 kon Ran 

54 k.d = -k.d 

55 

56 c.x.n = \ 

57 (k.n *« k.n * pl.x.d * p2.x.d - \ 

58 pl.x.n * k.d * k.d * p2.x.d - \ 

59 p2.x.n *« k.d *« k.d * pl.x.d) 

60 c.x.d = k.d « k.d * pl.x.d * p2.x.d 

61 

62 c.y.n = \ 

63 k.n * pl.y.d *« \ 

64 (pL eCuk d = -e xk se pls ke): = 
‘ 

65 k.d *« pl.x.d *« c.x.d * pl.y.n 

66 c.y.d = k.d *« pl.x.d *« c.x.d * pl.y.d 

67 

68 fraction_reduce(c.x) 

69 fraction _reduce(c.y) 

70 

71 LE Ol dS) .0% 

72 c.X.n = -Cc.xX.n 

73 C.Xed S-HCsx sd 

74 

75 DEG Syed. 04 

76 c.y.n = -c.y.n 

77 Cay id s+ =e l yd 


78 
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79 else: 

80 # Points are different 

81 k.n = pl.x.d * p2.x.d « \ 

82 (p2.y.n * pl.y.d - pl.y.n * p2.y.d) 

83 k.d = pl.y.d * p2.y.d«* \ 

84 (p2.x.n * pl.x.d - pl.x.n * p2.x.d) 

85 

86 if k.d < 0: 

87 kon SH ke 

88 k.d = -k.d 

89 

90 ec.x.n = \ 

91 (k.n *« k.n * pl.x.d * p2.x.d - \ 

92 pl.x.n * k.d *« k.d * p2.x.d - \ 

93 p2.x.n « k.d *« k.d * pl.x.d) 

94 c.x.d = k.d « k.d *« pl.x.d * p2.x.d 

95 

96 c.y.n = k.n « pl.y.d «* \ 

97 (pil. ® .C.x..d) + 1Cl xen epl. xed)” = 
X 

98 k.d *« pl.x.d *« c.x.d * pl.y.n 

99 c.y.d = k.d *« pl.x.d * c.x.d * pl.y.d 

100 

101 fraction _reduce(c.x) 

102 fraction_reduce(c.y) 

103 

104 LE ieekied, 0% 

105 c.X.n = -Cc.x.n 

106 C2 SC usd 

107 

108 if c.y.d < 0: 

109 c.y.n = -c.y.n 

110 E2y.d-=:+Clysd 

111 

112 return c 

113 

114 

115|def gcd(a, b): 

116 r= 0 

117 

118 # Euclid’s algorithm 

119 while b != 0: 

120 CS % b 


a 
121 a=b 
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122 be=r 

123 

124 return a 

125 

126 

127 fraction_reduce(fraction) : 

128 temp = gcd(fraction.n, fraction.d) 
129 

130 fraction.n //= temp 

131 fraction.d //= temp 

132 

133 

134 RationalPoint (RationalFraction 
135 RationalFraction 
136 | p2 RationalPoint (RationalFraction 
137 RationalFraction 
138 

139|with open(’input.txt’) as file: 

140 pl.x.n, pl.x.d, pl.y.n, pl.y.d = 
141 [int (num) for num in next (fil 
142 

143 p2.x.n, p2.x.d, p2.y.n, p2.y.d = 
144 [int (num) for num in next (fil 
145 

146; result = add(pl, p2) 

147 

148 | with open(’output.txt’, ‘wt+’) as file: 
149 output = '(%d / %d, %d / dad )' & \ 
150 (result.x.n, result.x.d, \ 

151 result.y.n, result.y.d) 


152 
153 file.write (output) 


For operations with rational fractions, a class RationalFraction is im- 
plemented in the program, which consists of two fields—the numerator and the 
denominator of the fraction. 

An arbitrary rational point of the plane is characterized by its abscissa and 
ordinate. Each of these coordinates is a rational fraction; this is why the de- 
scription of the class RationalPoint includes two fields that store the objects 
RationalFraction. 

Thus, a rational point of the elliptical curve is represented in the program by a 
class containing the objects of other classes. 
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For example, the point P;(3, —2) of the Cartesian plane can be written in the 


3 2 
form (=. -=) = RationalPoint (RationalFraction(3, 1), and be 


represented in the memory of computer as 


pl = {{3, 1}, {—2, 1}}. 
The fields are referred to as follows: 


* pl.x.n—numerator of the abscissa of the point P; 
* pi.x.d—denominator of the abscissa of the point P); 
* pl.y.n—numerator of the ordinate of the point P; 
* pl.y.d—denominator of the ordinate of the point P). 


The fraction’s sign will be the sign of its numerator, while in the intermediary 
computations the sign of the denominator will be preserved as positive. 

We should separately discuss the issue of representing the point at infinity O. In 
this implementation, it is introduced as an object 

O = RationalPoint (RationalFraction(1, 0), RationalFraction(1, 
0)) 

i.e. the denominators of its coordinates are equated to zero. 

The main procedure that executes the addition of the points is called 
add () and it operates as follows. In its code, an algorithm presented on page 393 
is directly reflected, and the arithmetic operations on rational fractions are executed 
separately for the numerator and for the denominator. 

In particular, addition of two fractions is presented in the form: 


pl.x.n p2.x.n  pl.x.n*p2.x.d+p2.x.n*pl.x.d (12.35) 
pl.x.d p2.x.d__ pl.x.d*p2.x.d , : 


Similarly, subtraction, multiplication and division are implemented in the pro- 
gram. Such arithmetic operations may result in a reducible fraction. For example, 


the summation 3 + 3 is performed as follows: 


2 1 2-341-3 9 

si. a =e (12.36) 
In this connection, prior to returning the result, the function add () divides the 

numerator and the denominator of the fraction by their greatest common divisor, 

gcd. For this purpose, an auxiliary function fraction reduce () is called. 
Computing of the greatest common divisor of two integers is based on the widely 

known Euclid’s? algorithm [32]. 


3Euclid (EbxAetd7c) (about 325 BC-before 265 BC), Ancient Greek mathematician. 
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At the final stage, the program outputs a sum of two rational points in the form 
(Cixi Clx.de eyo f ce vyed: ) 


where c.x.n is the numerator of the abscissa of the result, c.x.d is the 
denominator of the abscissa of the result, etc. 


Review Questions 


Ne 


oo 


. Define elliptic curve. 
. How is the discriminant of the equation of an elliptic curve computed? 
. Explain how one can, by the sign of the discriminant of the elliptic curve, 


establish the number of connected components of its graph. 


. Enumerate the properties of the point at infinity O. 
. Explain how the sum of the points A @ B on the elliptic curve is computed. 
. How can one, knowing the coordinates of some point on the elliptic curve, find 


the coordinates of the opposite point? 


. Formulate the necessary and sufficient condition of equality O of the sum of 


three points on the elliptic curve. 


. Define the sum n of the points on the elliptic curve, where n is an integer. 
. Enumerate the properties of the operation of addition of points on the elliptic 


curve. 


Problems 


12.1. Which of the equations listed below specify the elliptic curves on the 
expanded Cartesian plane R? U {O}? 


(1) y2=x*—-x4+1; 


(2) ye al: 

(3) =x+1; 

(4) ces 

(5) y2? = x2 +3x+2; 
(6) y? = x3 —3x 42; 
(7) y? = x? — 15x/27; 
(8) ya=xt+x?. 


If the curve belongs to the class of elliptical ones, then compute its 
discriminant and construct a graph. 

12.2. Draw a graph of the curve S = {(x, y): y* = x* — 12x + 16} and explain 
why it does not belong to the class of elliptic curves. 

12.3. Compute the discriminant for each of the following elliptic curves: 


(1) y? =x? —2x 43; 
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*12.4. 


12.5. 


12.6. 


12.7. 


12.8. 


12.9. 


12.10. 
12.11. 
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(2) y=xe—x-1; 
(3) yo =x + 4x; 
(4) y? =x? —10x +8. 
How many connected components does the graph of them contain? 
Prove that the duplication formula for computing of the abscissa of the 


point 2A on the curve y* = x* + ax +b can be presented in the following 
form: 


3x3 ta\" 4 2ak = Bbag a 
Mie Vr igh. OD) 
4x4 + 4ax4 + 4b 


Verify that the points (—2,5), (—1, —5) and (103, 1045) belong to the 
elliptic curve y* = x* — 7x + 19. 
Verify that on the curve y* = x° + 15 the following equality 


2(1, 4) = (—119/64, —1499/512) 


is valid. 

Compute the following sums of the points on the elliptic curves on the set 
Q: 

(1) (—3, 3) @ (1, 3) on the curve y* = x? — 7x + 15; 

(2) (1, 4) @ (1, 4) on the curve y? ye see 1A 

Verify the validity of the equalities on the elliptic curve y? = x? — 7x + 
10: 

(1) (5, 10) 6 (9, 26) = (2, 2); 

(2) (5, 10) @ (2, —2) = (9, —26); 

(3) GS, 10) ® (2, 2) = (1/9, 82/27); 

(4) (5, —10) 6 C1, —2) = (-2, —4). 


Compute the sum of the points 
(—2, —4) @ (1, 2) ® (2, 2) ® (—3, 2) 


on the curve y* = x? — 7x + 10. 
Perform duplication of the point (5, 12) on the curve y* = x7 +x 4 14. 
Compute the sum of the points 


(1) 2(7, 19) on the curve y* = x7 +x4 11; 
(2) 3(1, 4) on the curve y? = x3 4+ x4 14. 
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*12.12. Compute the sum of the points 
2(2, 4) ® 2(33, 190) @ (1, 2) 


on the curve y* = x° + 5x — 2. 
*12.13. Compute the sum of the points 


(6, 16) ® (—1, —2) ® (—2)(9, —28) 


on the curve y* = x* + 5x + 10. 
12.14. Write a function that checks whether the rational point P(x, y) belongs to 
the elliptic curve y? = x* +ax +b. 


Answers and Solutions 


12.1 Solution. 
Elliptic curve, as is known, is defined by the relation 


y=axrt+axtb 


for some a, b € R on the condition 4a? + 27b” ¥ 0 (see page 383). Leaning upon 
this definition, we obtain 


(1) y? =x? —x +1 is not an elliptic curve, since the right side lacks the summand 
xv; 

(2) y? =x3+.x +1 is an elliptic curve; the respective values of the parameters are 
a = 1, b = 1, the discriminant is A = —16(4a? + 27b”) = —496; 

(3) y? =x + 1 is not an elliptic curve, since it lacks the cubic summand x; 

(4) y* = x} is not an elliptic curve, since a = 0, b = 0 and the equality 4a? + 
27b* = 0 is valid; 

(5) y* = x? + 3x +4 2 satisfies the definition of elliptic curve with the parameters 
a =3 and b = 2, the discriminant is A = —3456; 

(6) y* = x*?—3x-+2is not an elliptic curve, since a = —3, b = 2 and 4a*+27b? = 
0; 

(7) y* = x3—5x/27 satisfies the definition of elliptic curve,a = —5/27 andb = 0, 
A = 8000/19683; 

(8) the equation y> = x+ + x? includes a summand of the fourth degree x*, 
therefore, the respective curve does not belong to the class of elliptic curves 
(Fig. 12.5). 
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y=re+rtl y= 22 +3r+2 


y? = 2 — 152/27 


Fig. 12.5 Elliptic curves to Problem 12.1. The rectangular mesh on the panels (a)-(c) has a step 
equal to 2 


12.2 Solution. 

The graph of the curve S is presented in Fig. 12.6. The curve S is not elliptic, 
since the condition of its discriminant’s being other than zero is not valid: A = 
—16(4a3 + 27b?) = —16(4(—12)? + 27 - 16°) = 0. The geometric expression of 
this fact is the presence on the graph of a self-intersection point with the coordinates 
(—2, 0). 


12.3 Solution. 
Using the formula A = — 16(4a? + 27b’), we obtain 


(1) A = —3376, the graph has one connected component; 

(2) A = —368, one connected component; 

(3) A = —4096, one connected component; 

(4) A = 36, 352, the graph consists of two connected components. 
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Fig. 12.6 The curve S to 
Problem 12.2. The 
rectangular mesh has a step 
equal to 2. The point (2, 0) is 
an intersection point for the 
curve y? =x>— 12x4+ 16 


y =e? = 129+ 16 


12.4 Solution. 


2, 
3x5 +a 


Expanding the expression —2x, and collecting similar summands, 


YA 
we arrive at the formula for x24: 


2 

9x4 + 6ax2 +a? 

A= ee = Oy, 
4y4 


= 9x4 + 6ax% +a? — 8xAy4 
4y4 
_ 9x4 + 6ax% Lg = 8x4 (x4 +ax,4+b) 
A(x}, + ax, +b)? 
a — 2ax7 — 8bx4+a? 
4x3 + 4ax4 + 4b 


In algebraic transformations, the equality yt = a + ax, + b was used, valid for 
the coordinates of all points of the elliptic curve. 
Thus, the algebraic variation of the duplication formula is proved. 


12.5 Solution. 
Substitute the coordinates of each point into the equation of the curve y*> = 
x3 — 7x + 19: 


52 = (-2)3 —7(-2) + 19, or 25 = 25; 


($57 = (1)? =7E 1) + 19; or 25 = 25: 
1045* = 1037 — 7(103) + 19, or 1092025 = 1092025. 
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In all three cases we obtain true equalities, this is why the considered points 
belong to the elliptic curve y? = x? — 7x + 19. 


12.6 Solution. 
Use the duplication formula (12.12). Substitute in it the numeric values from the 
problem statement a = 0, b = 15, x4 = 1 and yg = 4. Then we obtain 


ee ena 3\2 119 
ee es) See) age 
2-4 8 64 


cats 119\\ ° 1499 
—— 8 64)) 512" 


Therefore, the equality 2(1,4) = (—119/64, —1499/512) is valid on the curve 
y? = x9 +15. 


12.7 Answer: 

(1) (2, -3); 

(2) (—7/4, —21/8). 

12.9 Answer: (—2, —4). 

12.10 Answer: (1/36, 809/216). 
12.11 Answer: 

(1) (422/361, 25449/6859); 
(2) (793/121, —23132/1331). 
12.12 Answer: (33, 190). 

12.13 Answer: O. 


12.14 Solution. 

Let us assume that the rational point P(x, y) is represented in the computing 
system memory as the object of a class RationalPoint described in Listing 
12.1 dines 14-21 of the program code). 

Let us show the implementation of the function is point () that returns the 
value True or False depending on the belonging of the point P(x, y) to the 
elliptic curve y* = x3 +.ax +b. 


def is point (p): 
# Point at infinity 
if p.x.d == 0 and p.y.d == 0: 
return True 
elif p.x.d == 0 or p.y.d == 0: 
raise ValueError("Zero denomitator of a coordinate") 


# Checking the condition y* y=x* X* xX+a+x* x +b 
templ = a.d « b.d * p.x.n * p.x.n * p.x.n + \ 
a.n«*« b.d *« p.x.n * p.x.d *« p.x.d + \ 
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a.d «x b.n *« p.x.d * p.x.d *« p.x.d 
a.d «x b.d *« p.x.d * p.x.d * p.x.d 
temp2 * p.y.n * p.y.n - templ * p.y.d * p.y.d 


temp2 
temp3 


return temp3 == 0 


Appendix A 
Basic Operators in Python and C 


This book uses the Python language to write algorithms [68, 71]. Of course, 
when necessary, all the algorithms presented in the text can be rewritten using 
any other programming language. In the present Appendix we provide a table of 
correspondences between the basic constructs of Python 3 and their analogues in 
the C language (see Table A.1). 

Both these languages are high-level programming languages, although the level 
of abstraction in Python is considered to be higher compared to the C language [50]. 
As arule, this results in slower operation of programs in Python. 

One of the important differences between the syntaxes of these two languages 
consists in that the commands in the C end with a comma, while in Python it is 
not necessary to put a semicolon at the end of the command. Another significant 
difference is associated with marking out a block of operators: C uses braces for 
this purpose, while Python uses an indent consisting of exactly four spaces. 

A new class in Python can be created as follows: 


class Point: 
def init (self, x, y): 
self.x = x 
self.y = y 


Such a class was used in Chap. 7 for describing a point of the Cartesian plane. 
This class contains fields x and y, representing the coordinates of the point. Also, 
this class includes a class constructor that is called when creating an object and is 
used for initialization of its fields. In order to refer to the fields, the keyword self 
can be used, which represents the current class instance automatically transferred as 
an argument into each method of this class. 

The C language, unlike Python, is not object-oriented, and C uses structures 
instead of classes [21, 39]. 

Let us enumerate some more features of Python reflected in the listings of the 
programs. 
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A Basic Operators in Python and C 


Table A.1 Correspondences between the basic operators in Python and C 


Command or 
operation 


Assignment 
Integer variables 
Real variables 


Logic variables 
String variables 


Arrays 


Comparison of 
variables 


Logic operations 


Arithmetic 
division 

Integer division 
Comments 


Conditional 
operator 


Ternary operator 


for loop 


while loop 


Functions 


Exchange of 
values of two 
variables 


x = y x = Yi 

fm, m= 10, -17 | int_m = 10, n = -17; 

v = 0.005 float v = 0.005; 

w= -1.4 double w = -1.4; 

u = True iss 15 

v = False vos Of 

s = "String text" const char* s = "String 
text"; 

arr = [1, 2, 3] int arr[3] = {1, 2, 3}; 

arr[2) = ‘7 arr[2] = 7; 

x l= y x != y; 

(not A) and (B or C) (1A) && (B || C); 


n/n 


# comment // comment 
mun Text of multiline /x Text of multiline 
comment """ comment x/ 
if a == b: if (a == b) { 
# Codel // Codel 
elif a == c: 
# Code2 else if (a == c) { 
else: // Code2 
# Code3 } 
else { 
// Code3 
} 
|maxv = a if as=b else b |maxv = (a>=b) ? a: bj 
for i in range(n): for(int i=0; i<n; i++) { 
# Code // Code 
} 
while a == b: while (a == b) { 
# Code // Code 
def sm(a, b): i sm(int a, int b) { 
a+b int s =a+b; 
return s return s; 
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In Python, there is a method for generation of lists (including multidimensional 
ones): 


* creating a list of n numbers filled with the values from 0 ton - 1: 
V = [i for i in range(n) ] 
¢ creating a two-dimensional list (matrix) filled with zeroes: 
A = [[ 0 for j in range(n) ] \ 
for i in range(n) ] 


Similarly to many other languages, Python provides the means for dealing with 
exceptions, which are useful for processing error situations. So, in order to generate 
an exception, the keyword raise is used: 

raise Exception("Exception message") 

In order to process the exception, the construct try-except is used: 


try: 
a=5 
b = 0 
c=a/b 
except ZeroDivisionError as e: 
print (e) 


After executing this code area, the following message will be outputted to the 
console: 
division by zero 


Appendix B 
Trigonometric Formulae 


In the formulae of this Appendix, unless otherwise specified, a, b € Randk, k’ € Z. 


sin’ a+ cos*a =1; (B.1) 
sina WU 
tana = ——, a#—+4 zk; (B.2) 
cosa 2 
cosa 
cota = ——, aF¢mzk; (B.3) 
sina 
2 1 W 
1+ tan°a = —~—, a4—+7zk; (B.4) 
cos? a 2 
2 1 ; 
1+ cot"a = — zs a Hx 1k; (B.5) 
sin‘ a 
sin2a = 2sinacosa, cos 2a = cos” a — sin* a; (B.6) 
2 tana mz ak a 
tan 2a = ——_, —+—, —4+a7k’; B.7 
ene AT eg oe ca 
1- 1 
sin2 as — a cos a = baa diced (B.8) 
2 2 2 2 
sin(a + b) = sinacosb+ cosa sinb; (B.9) 
sin(a — b) = sinacosb — cosa sinb; (B.10) 
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cos(a + b) = cosacosb — sina sin b; 


cos(a — b) = cosacosb + sina sin b; 


ane) tana + tanb b ay ae ae k 
an(a = ————_.,, a, b,a —+ak; 
1 — tana tanb 2 
‘aint b) tana — tanb b a ae k 
an(a — = ———,, 4, 0, a -— = WK; 
1+ tanatanb 2 
. . . (atb a—b 
sina+sinb = 2 sin ( ) cos ( 
2 2 
. ; a+b a—b 
sina —sinb = 2c0s ( ) sin ( 
2 2 
b b 
cosa+cosb = 2eos (“+ ) cos (— ) 
2, 2 
b 
cosa — cosb = ~2sin(S ) sin (S °). 
sin(a + b) Tu 
tana + tanb = ————_, a,b4—4m7Zk; 
cosa cos b 2 
i bo 
cotatcotb= sue: a,bAémrk; 
sina sinb 


1 

sina sinb = 5 (cos(a — b) — cos(a + b)); 
1 

cosa cosb = 5 (cos(a — b) + cos(a + b)); 


1 
sina cosb = 5 (sin(a — b) + sin(a + b)). 


(B.11) 
(B.12) 


(B.13) 


(B.14) 


(B.15) 


(B.16) 


(B.17) 


(B.18) 


(B.19) 


(B.20) 


(B.21) 


(B.22) 


(B.23) 


Appendix C 


The Greek Alphabet 
A,a alpha 
B, 6 beta 
yy gamma 
A,6 delta 
E,é epsilon 
Z,C zeta 
H, 7 eta 
0,0 theta 
Iu iota 
K, x kappa 
A,ix lambda 
M, wu mu 
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nu 
Xi 
omicron 
pi 

rho 
sigma 
tau 
upsilon 
phi 

chi 

psi 


omega 
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Numbers 
(0, 1)-matrix, see Matrix, binary 


A 
Abel—-Ruffini theorem, see Theorem, 
Abel-Ruffini 
Abscissa, 256 
Algebraic complement, see Cofactor 
Algebraic form of a complex number, see 
Complex number, algebraic form 
Algorithm, 12 
of addition of points on the elliptic curve, 
393 
computing the determinant, 68 
Euclid’s, 398 
Gram-Schmidt (see Gram—Schmidt 
process) 
properties, 12 
determinateness, 12 
directedness, 12 
discreteness, 12 
elementary character of steps, 12 
mass character, 12 
Roy—Warshall (see Algorithm, Warshall) 
Strassen, 11 
Warshall, 18, 20, 29 
modified, 29 
Alphabet 
Greek, 413 
Latin, | 
Altitude of a triangle, 291 
Angle between a line and a plane, 325 
Angle between lines, 281, 323 
adjacent, 281 
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Angle between planes, 312 
Annihilating polynomial, see Polynomial, 
annihilating 
Anomaly of eccentricity, 360 
Anti-Hermitian matrix, see Matrix, anti- 
Hermitian 
Applicate, 256 
Argand diagram, 174 
Argument of a complex number, 175 
principal value, 175 
Array, 15 
two-dimensional, 15 
Associativity of addition, 6 
Associativity of multiplication, 8 
Asymptotes to a hyperbola, 361 
Asymptotic complexity, 14, 184 
of Gaussian elimination, 137 


B 

Backward pass, 137 

Basic minor, see Minor, basic 

Basic minor theorem, see Theorem, basic 

minor 

Basic operations, 13, 14 

Basis expansion of a vector, 257 

Basis states, 184 

Basis step, 53 

Big O notation, 14 

Bilinear form, 335 
polar, 338 
positively definite, 336 
properties of linearity, 335 
symmetric, 336 

Bisector of a triangle, 291 
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Bit, 183 
Block matrix, 186 
Bordering minor method, see Method, 


bordering minor 
Branch of a hyperbola, 361, 369 


Cc 
Canonical equation of a line, 321 
Canonical form of a quadratic form, 339 
Canonical form of equation, 178 
Canonical form of the equation of curve of the 
second order, 357 
Canonical system of coordinates, see 
Coordinate system, canonical 
Cardinality, 19 
Cauchy—Bunyakovsky inequality, see 
Inequality, Cauchy—Bunyakovsky 
Cauchy—Schwarz inequality, see Inequality, 
Cauchy—Bunyakovsky 
Cayley—Hamilton theorem, see Theorem, 
Cayley—Hamilton 
Central curves, 362 
Characteristic equation, 226 
Characteristic polynomial, see Polynomial, 
characteristic 
Circle, 357 
class O(g(n)), 14 
Coefficients of a linear combination, 61 
Coefficients of a system, 123 
Coefficients of the straight line equation, 278 
Cofactor, 46 
Collinearity, 223 
Collinear vectors, 255 
Column of a matrix, | 
Columns 
linearly dependent, 61 
linearly independent, 61 
Commutativity of addition, 6 
Commutator, 9 
Complementary minor of the element, 46 
Complex number, 173-178 
algebraic form, 174 
exponential form, 176 
imaginary part, 173 
purely imaginary, 174 
real part, 173 
trigonometric form, 174, 175 
Components of a vector, 217 
Composition of operators, 187 
Computational basis states, 184 
Condition of belonging of lines to the same 
plane, 326 


Index 


Condition of intersection of lines, 326 
Condition of line’s belonging to the plane, 325 
Condition of orthogonality of planes, 312 
Condition of parallelism of a line and a plane, 
325 
Condition of parallelism of planes, 312 
Condition of parallelism of two lines, 326 
Condition of perpendicularity of a line and a 
plane, 325 

Conjugate of the complex number, 174 
Conjunction, 20, 21 
Constant term, 357 
Controlled NOT, 189 
Controlled phase element, 189 
Coordinates of a vector, 217 
Coordinate system 

canonical, 357 

Cartesian, 174, 256, 277 

polar, 174 
Coplanar vectors, 264, 311 
Corner minor, see Minor, corner 
C, programming language, 2, 15 
Cramer’s rule, see Method, Cramer 
Cross-diagonal, see Secondary diagonal 
Cross product, see Vector product 
Cryptography, 384 
Curves of the second order, 357 

degenerate, 363 

non-degenerate, 363 
Cusp, 384 


D 
Degree of polynomial, 178 
Determinant, 43 
of the first order, 45 
Laplace expansion, 47 
of the n-th order, 45, 46 
of the second order, 43 
of the third order, 43, 44 
Vandermonde, 67 
Determinant of a system of vectors, 221 
Deviation of a point from a plane, 313 
Deviation of a point from line, 285 
Diagonalization of a matrix, 230 
Diagonal matrix, see Matrix, square, diagonal 
Difference of matrices, 6 
Difference of vectors, 218 
Digraph, see Graph, directed 
Dirac matrices, 185, 186 
Dirac notation, 183 
Directing vector, see Vector, directing 
Direction cosines, 310 
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Directrix 
of an ellipse, 358 
of a hyperbola, 361 
of a parabola, 362 
Disjunction, 21 
Distance between the points, 259 
Distance from a point to a plane, 313 
Distributivity of multiplication with respect to 
addition, 8 
Dot product, see Scalar product 
Duplication formula, see Formula, of 
duplication 
Duplication of a point, 388 


E 
Eccentricity 
of a hyperbola, 361 

Eccentricity of an ellipse, 358 

Echelon matrix, see Matrix, echelon 

Edge, 19 

Eigenvalue, 226 

Eigenvector, 226 

Element 7/8, 188 

Elementary transformations, 49, 61, 129 

Elements of a matrix, | 

Ellipse, 357 

Elliptic curve, 383 

Endpoints, 19 

Endpoints of a segment, 287 

Equal complex numbers, 173 

Equality of matrices, 4 

Equal vectors, 217, 255 

Equation of a line that passes through the two 
points, 323 

Equation of a plane that is orthogonal to the 
vector, 307 

Equation of a plane that passes through the 
three points, 312 

Equation of a straight line through two given 
points, 280 

Equation of curve of the second order, 357 

Euclidean space, 224 

Euclid’s algorithm, see Algorithm, Euclid’s 

Euler’s formula, see Formula, Euler’s for 
complex numbers 

Euler’s identity for Pauli matrices, 200 

Evolution of the quantum system, 184 

Expansion of the vector in the basis, 222 

Exponential form of a complex number, see 
Complex number, exponential form 

Exponent of a matrix, 57 
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F 
Fermat last theorem, see Theorem, Fermat last 
Focal distance of an ellipse, 358 
Focal parameter of a parabola, 362 
Focus 
of an ellipse, 357 
of a hyperbola, 361 
of a parabola, 362 
Formula 
Cardano, 179 
de Moivre’s, 176, 195 
of duplication, 400 
Euler’s for complex numbers, 176 
matrix product inversion, 71 
Fortran, programming language, 15 
Forward pass, 136 
Free unknown, 138 
Function 
exponential, 176 
fractionally rational, 57 
time-complexity, 14 
trigonometric, 176 
Fundamental system of solutions, 138, 139 
Fundamental theorem of algebra, see Theorem, 
fundamental of algebra 


G 
Gate, 184, 187 
Gaussian elimination, see Method, Gauss 
General equation of a plane, 308 
complete, 308 
incomplete, 308 
General equation of a straight line on a plane, 
278 
Gram-Schmidt process, 224 
Graph, 19 
directed, 18, 19, 29 
undirected, 19 
Graph diagram, 19 
Graph theory, 19 


H 

Hadamard element, 188 

Hadamard gate, 188 

Hermitian conjugate matrix, see Matrix, 
Hermitian conjugate 

Hermitian matrix, see Matrix, Hermitian 

Hesse normal form, see Normal form of the 
equation of the line 

High-performance computing, || 
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Hoare triple, 13 
Hyperbola, 360 
equilateral, 369 


I 
Identity permutation, see Permutation, identity 
Identity transformation, 188 
Imaginary part of a complex number, see 
Complex number, imaginary part 
Imaginary unit, 173 
Incident vertex and edge, 19 
Incomplete gamma function, 69 
Inductive step, 53 
Inequality 
Cauchy—Bunyakovsky, 232 
triangle (see Triangle inequality) 
Inner product, see Scalar product 
Intercept form of the equation of a plane, 309 


Intercept form of the equation of a straight line, 


283 
Inversion, 45, 65 


J 
Jacobi identity 
for commutators, 10 
for vectors, 265 
Jacobi method, see Method, Jacobi 
Java, programming language, 15 


K 

Known terms, 123 

Kronecker—Capelli theorem, see Theorem, 
Kronecker—Capelli 

Kronecker symbol, 4 


L 

Lagrange’s identity, 195, 265 

Lagrange’s method, see Method, Lagrange’s 

Laplace expansion, 47 

Law of inertia, 344 

Length of a vector, 224 

Linear combination of rows, 61 

Linear combination of vectors, 218 

Linearly dependent system of vectors, 219 

Linearly independent system of vectors, 218 

Linear term, 357 

List, 15 

Logarithm of a matrix, 58 

Lower triangular matrix, see Matrix, lower 
triangular 
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M 
Magnitude of a vector, 255 
Main diagonal, 3 
Matrix, | 
adjacency, 19 
adjugate, 50 
anti-Hermitian, 198 
of bilinear form, 335 
binary, 5 
brief record, | 
classical adjoint (see Matrix, adjugate) 
cofactor, 50 
complex, 2 
echelon, 61 
functional, 11 
Hermitian, 181 
Hermitian conjugate, 180 
Hilbert, 71 
inverse, 50, 126 
lower triangular, 4, 51 
notations, 2 
of quadratic form, 337 
reachability, 20, 29 
real, 2 
rectangular, 2 
self-adjoint (see Matrix, Hermitian) 
square, 2 
antisymmetric, 4 
degenerate (see Matrix, square, 
singular) 
diagonal, 3, 52 
identity, 3, 50 
nondegenerate (see Matrix, square, 
nonsingular) 
nonsingular, 50 
null (see Matrix, square, zero) 
singular, 50 
symmetric, 4 
unit (see Matrix, square, identity) 
zero, 3 
transposed, 3, 4 
unitary, 182 
upper triangular, 4, 51 
Matrix algebra, 5 
Matrix form of a system, 124 
Matrix of a system, 124 
augmented, 124, 129 
Matrix of a system of vectors, 221 
Matrix power theorem, see Theorem, matrix 
power 
Matrix, skew-Hermitian, see Matrix, 
anti-Hermitian 
Measurement of state, 184 
Median of a triangle, 291 


Index 


Method 

bordering minor, 64 

Cramer, 125 

elementary transformations, 61 

of elimination of unknowns (see Method, 

Gauss) 

Gauss, 129 

Gauss—Jordan, 150 

of an inverse matrix, 126 

Jacobi, 343 

Lagrange’s, 340 

mathematical induction, 21, 53, 195 
Minkowski inequality, see Triangle inequality 
Minor, 59 
basic, 60 
corner, 343 
Mixed product, see Scalar triple product 
Modulus of a complex number, 175 
Modulus of a vector, 255 


N 
n-dimensional space, 218 
Normal, see Normal vector 
Normal equation of a plane, 310 
Normal form of the equation of the line, 284 
Normalized vector, 220, 255, 257 
Normalizing factor, 285 
Normal vector, 278 
Norm of a vector, 224 
properties, 224 
Null vector, see Zero vector 
Numbers 

complex, 173 

ratio, 192 

Fibonacci, 67 
Number theory, 384 
NumPy, library, 18 
n-vector, see Vector 


oO 
Opposite point, 386 
Opposite vector, 218 
Optical property 

of an ellipse, 359 

of hyperbola, 361 

of a parabola, 363 
Ordinate, 256 
Orthogonality, 223, 224, 258 
Orthonormality, 224 


P 
Parallelepiped, 263, 264 
Parameter of a line, 321 
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Parameter of a parabola, see Focal parameter 
of a parabola 
Parameter of a segment, 287 
Parametric equation of a line, 321 
Parametric form of an ellipse, 359 
Pascal, programming language, 15 
Path, 19 
trivial, 19 
Pauli element, 188 
Pauli matrices, 185 
Pencil of planes, 313 
Permutation, 45, 65 
identity, 45 
Phase element, 188 
Pivot element, 136 
Pivot row, 61 
Plane, 277 
complex, 174 
Point at infinity, 384 
Polynomial, 45, 56, 178 
annihilating, 230 
characteristic, 227 
of a matrix, 56 
Position vector, 174, 176, 256 
Postcondition, 13 
Precondition, 13 
Predicate, 53 
Principle of mathematical induction, see 
Method, mathematical induction 
Product of a number and a matrix, 6 
Product of a number and a vector, 217 
Product of complex numbers, 173 
Product of matrices, 7 
Projection of a point on the line, 256 
Projection of a vector on the line, 256 
Projection on |0) and |1), 188 
Proof of the algorithm correctness, 12 
Properties of determinants, 48 
Property of identity matrix, 8 
Property of zero matrix, 8 
Pythagorean theorem, see Theorem, 
Pythagorean 
Python, programming language, 2, 15, 68,71, 
135 


Q 

Quadratic form, 337 
alternating, 344 
degenerate, 338 
negatively definite, 344 
nondegenerate, 338 
positively definite, 344 

Quadratic term, 357 
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Quantum computer, 183 
Quantum-mechanical operator, 186 
Quantum system, 183 

Qubit, 183 


R 
Rank, 60, 135 
Rank of a quadratic form, 338, 343 
Real part of a complex number, see Complex 
number, real part 
Right-hand side of equations, see Known terms 
Root 
of equation, 178 
n-th of complex number, 176 
of unit, 177, 194, 195 
Rouché—Capelli theorem, see Theorem, 
Kronecker—Capelli 
Route, 19 
Row of a matrix, | 
Rows 
linearly dependent, 61 
linearly independent, 61 
Roy—Warshall algorithm, see Algorithm, 
Warshall 


Ny 
Scalar, 1 
Scalar product, 223 
properties, 223 
Scalar triple product, 263 
properties, 263 
Secondary diagonal, 3 
Secular equation, 230 
Secular motion, 230 
Segment, 287 
Self-adjoint matrix, see Matrix, Hermitian 
Self-intersection, 384 
Semiaxis 
ellipse 
major, 357 
hyperbola 
imaginary, 360 
real, 360 
Sequence 
Fibonacci (see Numbers, Fibonacci) 
Series, 57 
Set 
of complex numbers, 173 
of real numbers, 173 
Signature of a quadratic form, 344 
Similarity transformation, 225 
Similar matrices, 225 


Index 


Skew-Hermitian matrix, see Matrix, 
anti-Hermitian 
Skew lines, 326 
Slope-intercept form of the equation of a 
straight line, 277 
Slope-intercept form of the equation of a 
straight line passing through the 
given point, 279 
Slope of a straight line, 277 
Solution of the system of equations, 123 
trivial, 138 
Space 
n-dimensional, 218 
one-dimensional, 218 
three-dimensional, 218 
two-dimensional, 218 
Spur, see Trace 
Strassen algorithm, see Algorithm, Strassen 
Sum of complex numbers, 173 
Sum of matrices, 5 
Sum of vectors, 217,255 
Sylvester’s criterion, 344 
System of linear equations, 123 
consistent, 123 
determined, 123, 125 
homogeneous, 123, 140, 226 
inconsistent, 123 
non-homogeneous, 123 
rectangular, 123 
square, 123, 124 
undetermined, 123 


T 
Tangent, 359, 365 
to an ellipse, 359 
to a hyperbola, 361 
to a parabola, 363 
Tetrahedron, 264, 315 
Theorem 
Abel-Ruffini, 178 
basic minor, 61 
Cayley—Hamilton, 230 
Fermat last, 384 
fundamental of algebra, 178 
Kronecker—Capelli, 129, 220 
Laplace, 47 
matrix power, 56 
Pythagorean, 233 
Rouché-—Capelli (see Theorem, Kronecker— 
Capelli) 
Time-complexity function, see Function, 
time-complexity 
Trace, 11, 31 
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of the unit matrix, 11 directing, 321 
Transposition, 48 Vector equation of a plane, 307 
Triangle inequality, 193, 224 Vector product, 260 
Triangle rule, 44 properties, 260 
Trigonometric form of a complex number, see Vector triple product, 265 

Complex number, trigonometric Vertex, 19 
form of an ellipse, 357 
Trigonometric formulae, 411 of a hyperbola, 361 


of a parabola, 362 
Viéte formulae, 195 


U 
Unitary matrix, see Matrix, unitary 
Unit vector, 255 WwW 
Upper triangular matrix, see Matrix, upper Warshall algorithm, see Algorithm, Warshall 
triangular 
Z 
Vv Zero equation, 129 


Vector, 1, 217, 255 Zero vector, 218, 220, 255 


